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Abstract 

We investigate a new model for populations evolving in a spatial continuum. 
This model can be thought of as a spatial version of the A-Fleming-Viot pro- 
cess. It explicitly incorporates both small scale reproduction events and large scale 
extinction-recolonisation events. The lineages ancestral to a sample from a popula- 
tion evolving according to this model can be described in terms of a spatial version 
of the A-coalescent. Using a technique of Evans (1997), we prove existence and 
uniqueness in law for the model. We then investigate the asymptotic behaviour of 
the genealogy of a finite number of individuals sampled uniformly at random (or 
more generally 'far enough apart') from a two-dimensional torus of sidelength L 
as L — >■ oo. Under appropriate conditions (and on a suitable timescale) we can 
obtain as limiting genealogical processes a Kingman coalescent, a more general A- 
coalescent or a system of coalescing Brownian motions (with a non-local coalescence 
mechanism) . 
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1 Introduction 



In 1982, Kingman introduced a process called the coalescent. This process provides 
a simple and elegant description of the genealogical (family) relationships amongst a 
set of neutral genes in a randomly mating (biologists would say panmictic) population 
of constant size. Since that time, spurred on by the flood of DNA sequence data, 
considerable effort has been spent extending Kingman's coalescent to incorporate things 
like varying population size, natural selection and spatial (and genetic) structure of 
populations. Analytic results for these coalescent models can be very hard to obtain, 
but it is relatively easy, at least in principle, to simulate them and so they have become 
fundamental tools in sequence analysis. However, models of spatial structure have 
largely concentrated on subdivided populations and a satisfactory model for the ancestry 
of a population evolving in a two-dimensional spatial continuum has remained elusive. 
Our aim in this paper is to present the first rigorous investigation of a new model that 
addresses some of the difficulties of existing models for spatially extended populations 
while retaining some analytic tractability. The rest of this introduction is devoted to 
placing this research in context. The reader eager to skip straight to the model and a 
precise statement of our main results should proceed directly to Section [2j 

Our concern here is with the extension of the coalescent to spatially structured 
populations. In this setting it is customary to assume that the population is subdivided 
into denies of (large) constant size, each situated at a vertex of a graph G, and model 
the genealogical trees using the structured coalescent. As we trace backwards in time, 
within each deme the ancestral lineages follow Kingman's coalescent, that is each pair 
of lineages merges (or coalesces) into a single lineage at a constant rate, but in addition 
lineages can migrate between demes according to a random walk on the graph G. The 
genealogical trees obtained in this way coincide with those for a population whose 
forwards in time dynamics are given by Kimura's stepping stone model (Kimura 1953) 
or, as a special case, if G is a complete graph, by Wright's island model (Wright 1931). 

The stepping stone model is most easily described when the population consists of 
individuals of just two types, a and A say. It can be extended to incorporate selection, 
but let us suppose for simplicity that these types are selectively neutral. Labelling the 
vertices of the graph G by the elements of the (finite or countable) set / and writing pi 
for the proportion of individuals in deme i of type a, say, we have 

dp^{t) = (Pj(^) - Pii^)) + VWi{t){'^-P^it))dWi{t) (1) 

where {Wi{t);t > 0}jg/ is a collection of independent Wiener processes, 7 is a positive 
constant and {mjjjjjg/ specifies the rates of a continuous time random walk on G. 
The graph G, chosen to caricature the spatial structure of the population, is typically 
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taken to be 1? (or its intersection with a two-dimensional torus) and then one sets 
ruij = corresponding to simple random walk. 

Although the stepping stone model is widely accepted as a model for structured pop- 
ulations, in reality, many populations are not subdivided, but instead are distributed 
across a spatial continuum. Wright (1943) and Malecot (1948) derived expressions for 
the probability of identity of two individuals sampled from a population dispersed in a 
two-dimensional continuum by assuming on the one hand that genes reproduce and dis- 
perse independently of one another, and on the other hand that they are scattered in a 
stationary Poisson distribution. However, these assumptions are incompatible (Felsen- 
stein 1975, Sawyer & Fleischmann 1979). The assumption of independent reproduction 
will result in 'clumping' of the population and some local regulation will be required to 
control the local population density. 

A closely related approach is to assume that the genealogical trees can be constructed 
from Brownian motions which coalesce at an instantaneous rate given by a function 
of their separation. The position of the common ancestor is typically taken to be a 
Gaussian centred on the midpoint between the two lineages immediately before the 
coalescence event (although other distributions are of course possible). However, the 
coalescent obtained in this way does not exhibit sampling consistency. That is, if we 
construct the genealogical tree corresponding to a sample of size n and then examine 
the induced genealogical tree for a randomly chosen subsample of size k < n, this 
will not have the same distribution as the tree we obtain by constructing a system of 
coalescing lineages directly from the subsample. The reason is that whenever one of the 
lineages in the subsample is involved in a coalescence event in the full tree it will jump. 
Furthermore, just as in Malecot's setting, there is no corresponding forwards in time 
model for the evolution of the population. 

Barton et al. (2002) extend the formulae of Wright and Malecot to population mod- 
els which incorporate local strTicture. The probability of identity is obtained from a 
recursion over timeslices of length At. Two related assumptions are made. First, the 
ancestral lineages of genes that are sufficiently well separated are assumed to follow in- 
dependent Brownian motions (with an effective dispersal rate which will in general differ 
from the forwards in time dispersal rate) and their chance of coancestry in the previous 
timeslice is negligible. Second, it must be possible to choose At sufficiently large that 
the changes in the population over successive timeslices are uncorrelated. (For general 
At this will not be the case. The movements of ancestral lineages in one time step may 
be correlated with their movements in previous steps if, for example, individuals tend 
to disperse away from temporarily crowded clusters.) Over all but very small scales, the 
resulting probability of identity can be written as a function of three parameters: the 
effective dispersal rate, the neighbourhood size and the local scale. However the useful- 
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ness of this result is limited due to a lack of explicit models for which the assumptions 
can be validated and the effective parameters established. Moreover, as explained in 
Barton et al. (2002), although one can in principle extend the formula to approximate 
the distribution of genealogies amongst larger samples of well-separated genes, addi- 
tional assumptions need to be made if such genealogies are to be dominated by pairwise 
coalescence. If several genes are sampled from one location and neighbourhood size is 
small then multiple coalescence (by which we mean simultaneous coalescence of three or 
more lineages) could become significant. 

Multiple merger coalescents have received considerable attention from mathemati- 
cians over the last decade. Pitman (1999) and Sagitov (1999) introduced what we now 
call K- coalescents^ in which more than two ancestral lineages can coalesce in a single 
event, but simultaneous coalescence events are not allowed. Like Kingman's coalcsccnt, 
these processes take their values among partitions of N and their laws can be prescribed 
by specifying the restriction to partitions of {1, 2, . . . , n} for each n G N. For our pur- 
poses, the A-coalescent describes the ancestry of a population whose individuals are 
labelled by N. Each block in the partition at time t corresponds to a single ancestor at 
time t before the present, with the elements of the block being the descendants of that 
ancestor. Tracing backwards in time, the evolution of the A-coalescent is as follows: if 
there are currently p ancestral lineages, then each transition involving j of the blocks 
merging into one happens at rate 



and these are the only possible transitions. Here, A is a finite measure on [0, 1]. King- 
man's coalescent corresponds to the special case A = the point mass at the origin. 

Remark 1.1 More generally, one can consider processes with simultaneous multiple co- 
alescence events. Such coalescents were obtained as the genealogies of suitably rescaled 
population models by Mohle & Sagitov (2001). Independently, Schweinsberg (2000) ob- 
tained the same class of coalescents and characterised the possible rates of mergers in 
terms of a single measure S on an infinite simplex. Coalescents which allow simultane- 
ous multiple mergers are now generally referred to as S-coalescents. 

Kingman's coalescent can be thought of as describing the genealogy of a random sample 
from a Fleming- Viot process. In the same way, a A-coalescent describes the genealogy of 
a random sample from a generalised Fleming- Viot process. This process takes its values 
among probability measures on [0, 1]. We shall describe it in terms of its generator, Tl 
acting on functions of the form 




(2) 
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where p G N and / : [0, 1]^ — t- M is measurable and bounded. First we need some 
notation. If x = (xi, . . . , Xp) £ [0, 1]^ and J C {1, . . . ,p} we write 

xf = Xmin J if i € J, and x/ = Xj if i ^ J, i = 1, . . . ,p. 

Then for A a finite measure on [0, 1], a A-Fleming-Viot process has generator 

JC{l,...,p},|J|>2 

where (3^^ is defined in Equation When A({0}) = 0, this can also be written 
nF{p) = [ [ (f({1 - u)p + u6k) - F{p))p{dk)u-^Aidu). 

i(0,l] J[0,1] ^ ' 

(When A({0}) > 0, one must add a second term corresponding to a classical Fleming- 
Viot process and somehow dual to the Kingman part of the A-coalescent.) In this case, 
an intuitive way to think about the process is to consider a Poisson point process on 
M+ X (0, 1] with intensity measure dt ® u~'^A{du), which picks jump times and sizes for 
p{t). At a jump time t with corresponding jump size n, a type k is chosen according to 
p{t—), an atom of mass u is inserted at k and p{t—) is scaled down by (1 — u) so that 
the total mass remains equal to one, i.e., 

p{t) = {l-u)p{t-) + u6k. (3) 

The duality between A-coalescents and A-Fleming-Viot processes was first proved by 
Bertoin & Le Gall (2003). Their approach uses a correspondence between the A- 
coalescents and stochastic flows of bridges. The duality can also be understood via 
the Donnelly & Kurtz (1999) 'modified lookdown construction' and indeed is implicit 
there. An explicit explanation can be found in Birkner et al. (2005). 

In recent work (described briefiy in Etheridge 2008), Barton & Etheridge have pro- 
posed a new class of consistent forwards and backwards in time models for the evolu- 
tion of allele frequencies in a population distributed in a two-dimensional (or indeed 
d-dimensional) spatial continuum which, in the simplest setting, can be thought of as 
spatial versions of the A-Fleming-Viot and A-coalescent models (although we empha- 
size that these are not the same as the spatial A-coalescents considered by Limic & 
Sturm 2006). They share many of the advantages of the classical models for spatially 
structured populations while overcoming at least some of the disadvantages. The idea 
is simple. Just as in the A-Fleming-Viot process, reproduction events are determined 
by a Poisson point process but now, in addition to specifying a time and a value u, this 
process prescribes a region of space which will be affected by the event. In what follows, 
the region will be a ball with random centre and radius. Within that region the effect 
is entirely analogous to Equation ([3]). 
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This approach differs from existing spatial models in three key ways. First, density 
dependent reproduction is achieved by basing reproduction events on neighbourhoods 
(whose locations are determined by the Poisson point process), rather than on individ- 
uals. Second, the offspring of a single individual can form a significant proportion of 
the population in a neighbourhood about the parent, capturing the essentially finite 
nature of the local population size. Third, large scale extinction-recolonisation events 
are explicitly incorporated. This reflects the large scale fluctuations experienced by real 
populations in which the movement and reproductive success of many individuals are 
correlated. For example, climate change has caused extreme extinction and recolonisa- 
tion events that dominate the demographic history of humans and other species (e.g. 
Filer et al. 2004). 

The spatial A-Fleming-Viot process, like its classical counterpart, can be obtained as 
a limit of individual based models. Those prelimiting models are discussed in Berestycki 
et al. (2009). In the (backwards in time) spatial A-coalescent, ancestral lineages move 
around according to dependent Levy processes (in fact they will be compound Poisson 
processes), jumping whenever they are affected by a reproduction event. Two or more 
lineages can coalesce if they are all affected by the same reproduction event. 

Our first aim here is to provide a precise mathematical description of the spatial 
A-Fleming-Viot process and the corresponding spatial A-coalescent model and address 
questions of existence and uniqueness. This is achieved through adapting the work of 
Fvans (1997). The idea is to first construct the dual (backwards in time) process of 
coalescing Levy processes corresponding to a finite sample from the population at time 
zero, and then to use a functional duality to define the forwards in time model. The 
principal difference between our setting and that of Fvans is that, in his work, ancestral 
lineages evolve independently until they meet. 

The system of coalescing Levy processes that describes the genealogy of a sample 
from the population, mirrors the system of coalescing random walks that plays the same 
role for the stepping stone model. For systems of coalescing walks a number of studies 
have investigated conditions under which, when viewed on an appropriate timescale, 
and for sufficiently well-separated samples, the effect of the geographical structure of 
the population can be summarised as a single 'effective' parameter and the system of 
coalescing lineages converges to Kingman's coalescent. One of the first works along these 
lines is due to Cox (1989), who considers random walks on a torus T{L)r\Z,'^ of sidelength 
L with the walks coalescing instantly on meeting. This corresponds to taking G = 
T(L) nZ"^ and 7 = cxd in Fquation ([T|). He shows that if one starts walks from any finite 
number n G N of points chosen independently and uniformly at random from T(L)nZ'^, 
then in suitable time units, as L — )• 00, the number of surviving lineages is determined 
by Kingman's coalescent. For two spatial dimensions, this analysis was extended by Cox 
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& Durrett (2002) and Ziihle et al. (2005) to random walks on T(L) n 1? with delayed 
coalescence (corresponding to 7 < oo). It is natural to ask whether similar results are 
true here. Our second aim then is to establish conditions under which the genealogy of a 
sample taken at random from a large torus will converge to a non-spatial coalescent. We 
shall concentrate on the most difficult, but also most biologically relevant, case of two 
spatial dimensions. If reproduction events only affect bounded neighbourhoods, then, 
not surprisingly, we recover a Kingman coalescent limit. However, we also consider the 
more general situation in which in addition to 'small' events that affect only bounded 
neighbourhoods we allow 'large' extinction-recolonisation events (see Section [3] for the 
precise setting). Unless these events affect a non-negligible proportion of the torus, on 
a suitable timescale, asymptotically we once again recover a Kingman coalescent. The 
timescale is determined by the relative rates of 'large' and 'small' events. However, if we 
have extinction-recolonisation events that affect regions with sidelength of order 0{L), 
then, again depending on the relative rates of 'large' and 'small' events, we can obtain a 
more general (non-spatial) A-coalescentlim.it or a system of coalescing Brownian motions 
(where the coalescence is non-local). 

The rest of the paper is laid out as follows. In Section [2] we define the model. 
In Section [HJ we give a precise statement of the conditions under which we obtain 
convergence of the genealogy of a random sample from a (two-dimensional) torus of side 
L as L — 7- 00. The corresponding convergence results are Theorem 13.31 and Theorem 13. 71 
In Section [5] we establish existence of the process and prove uniqueness in law. In 
Section [5] we gather the necessary results on Levy processes in preparation for our 
proofs of Theorem 13.31 and Theorem 13.71 in Sections [6] and [71 Finally, Appendices [X] and 
IB] contain the proofs of the technical lemmas stated in Sections [5] and [6l 

2 The model 

First we describe a prelimiting model. Individuals in our population are assumed to 
have a type taken from [0, 1] and a spatial position in a metric space E that we shall 
usually take to be (or the torus T(L) in M^). Even though it will be clear that 
existence and uniqueness of the process holds in much greater generality, the model 
is primarily motivated by considerations for populations evolving in two-dimensional 
continua. The dynamics are driven by a Poisson point process H on x x (0, 00) 
with intensity dt®dx®ii{dr). If {t,x,r) G H, the first component represents the time of 
a reproduction event. The event will affect only individuals in B{x, r), the closed ball of 
centre x and radius r. We require two more ingredients. The first, m, is a fixed positive 
constant which we shall refer to as the intensity of the model. Second, associated to 
each fixed radius r > there is a probability measure I'r on [0, 1]. In the sequel, we 
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assume that the mapping r i-> i/^ is measurable with respect to fi. 

For definiteness, suppose that the population is initially distributed according to 
a spatially homogeneous Poisson process. The dynamics of our prelimiting model are 
described as follows. Suppose that {t,x,r) G 11. Consider the population in B{x,r) at 
time t—. If the ball is empty, then nothing happens. Otherwise, independently for each 
event: 

1. Select a 'parent' uniformly at random from those individuals in B{x,r) at time 
t— and sample u G [0, 1] at random according to Ur. 

2. Each individual in B{x,r), independently, dies with probability u, otherwise it is 
unaffected by the reproduction event. 

3. Throw down offspring in the ball, with the same type as the selected parent 
(who may now be dead), according to an independent Poisson point process with 
intensity umLeb|g^^^-) where Leb denotes Lebesgue measure. 

Wc shall refer to these events as reproduction events, even though they are also used to 
model large-scale extinction-recolonisation events. Notice that recolonisation is mod- 
elled as being instantaneous even after a large scale extinction. 

Remark 2.1 For simplicity we have described only a special version of the model in 
which, even when the reproduction event affects a large region, recolonisation is through 
a single founder. This guarantees that if we look at the genealogy of a sample from this 
population, although we may see more than two lineages coalescing in a single event, we 
do not see simultaneous mergers. More generally it would be natural to take a random 
number of colonists and then, on passing to the limit, the corresponding model would 
yield a spatial E-coalescent. 

Any reproductive event has positive probability of leaving the corresponding region 
empty, but because the neighbourhoods determined by different reproduction events 
overlap, an empty region can subsequently become recolonised. Provided the measure 
/i(dr) decays sufficiently quickly as r — t- oo, Berestycki et al. (2009) show that there 
is a critical value of m above which the population, when started from a translation 
invariant initial condition, survives with probability one. The difficulty is that it is not 
easy to find an explicit expression for the distribution of the genealogical trees relating 
individuals in a sample from the population. Knowing that an ancestral lineage is in a 
given region of space gives us information about the rate at which that region was hit 
by reproduction events as we trace backwards in time. On the other hand, simulations 
reveal that this effect is rarely significant. Mathematically, we overcome this difficulty 
by considering a model in which the intensity m is infinite, but we preserve some of 
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the signature of a finite local population size by retaining the reproduction mechanism 
so that a non-trivial proportion of individuals in a neighbourhood are descended from 
a common ancestor. In particular, this will result in multiple coalescences of ancestral 
lineages. 

Now let us describe the model that arises from letting m — >■ oo. (That the prelimiting 
model really does converge to this limit will be proved elsewhere.) At each point x G M^, 
the model specifies a probability measure on type space which we shall write p{t,x, ■), 
or sometimes for brevity p^- The interpretation is that if we sample an individual from 
X, then its type will be determined by sampling from p^. The reproduction mechanism 
mirrors that for our discrete time model: 

Definition 2.2 (Spatial A-Fleming-Viot process) The spatial A-Fleming-Viot pro- 
cess, {p{t, X, •), X G M?, t > 0} specifics a probability measure on the type space [0, 1] for 
every t > and every x EM?. With the notation above, the dynamics of the process are 
as follows. At every point {t, x, r) of the Poisson point process H, we choose u G [0, 1] 
independently according to the measure Vr{du). We also select a point z at random from 
B{x,r) and a type k at random according to p{t—, z, •). For all y G B{x,r), 



Sites outside B{x, r) are not affected, that is p{t, y, •) = p{t—, y, •) for every y ^ B{x, r). 

Remark 2.3 There are many variants of this model, some of which are outlined in 
Etheridge (2008). The model presented here should be regarded as fitting into a general 
framework in which the key feature is that reproduction events are driven by a Poisson 
point process determining their times and spatial locations, rather than on individuals. 
Barton et al. (2009) investigate a version of the model in which, instead of replacing a 
portion u of the population in a disc at the time of a reproduction event, the proportion 
of individuals affected decays (in a Gaussian distribution) with the distance from the 
'centre ' x of the event. Whereas in the disc based approach in the prelimiting ( individual 
based) model we had to suppress reproduction events that affected empty regions, this is 
not necessary in the Gaussian model. Moreover, (in contrast to the disc model) in that 
setting the prelimiting model has the Poisson point process in with constant intensity 
m as a stationary distribution. Although the proofs would be rather involved, analogues 
of our results here should carry over to the Gaussian setting. 

Of course we must impose restrictions on the intensity measure if our process is to exist. 
To see what these should be, consider first the evolution of the probability measure 
p{t,x, •) defining the distribution of types at the point x. This measure experiences a 
jump of size y G A C (0, 1] at rate 



P{t, y, •) = (!- u)p{t-, y, •) + uSk- 




(0,oo) J A 
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By analogy with the A-Fleming-Viot process, we expect to require that 

A{du) = / u^r'^Ur{du)ii{dr) (4) 

i(0,oo) 

defines a finite measure on [0, 1]. In fact, in the spatial setting we require a bit more. 
To see why, suppose that ^ is a bounded measurable function on [0, 1] and consider 
the form that the infinitesimal generator of the process must take on test functions 
of the form {p(x,dk),ip{k)) (with angle brackets denoting integration). Denoting the 
generator, if it exists, by G we shall have 

Jr2 J(0,oo) J[0,1] J[o,i] T^r^ 

p{y, dk)vr{du)p{dr)dy 

f LjJx y) 

/ — — h— u[{p{y, •), ^) - {p{x, Vr{du)p{dr)dy, 

where Lr{x, y) denotes the volume of the set B{x, r) n B{y, r). Notice in particular that 
Lr{x,y) < '^r'^'i-{\x-y\<2r}- the non-spatial case, this term vanishes (set y = x), but 
here if we want the generator to be well-defined on these test functions we make the 
stronger 

Assumption 2.4 

A(du)=/ (5) 



(0,oo) 



defines a finite measure on [0, 1]. 



Condition ([5]) controls the jumps of p at a single point. Since we are going to follow 
Evans (1997) in constructing our process via the dual process of coalescing lineages 
ancestral to a sample from the population, we should check that such a process is 
well-defined. First we define the coalescent process more carefully. 

In order to make sense of the genealogy of a sample at any time, we extend the 
Poisson point process 11 of reproduction events to the whole time line (— oo,-|-oo). We 
need some notation for (labelled) partitions. 

Notation 2.5 (Notation for partitions) 1. For each integer n > 1, letVn denote 
the set of partitions of {1, . . . and define a labelled partition of {1, . . . 
with labels from a set E, to be a set of the form {(vri,^,^), . . . , (vr/c, x,rj.)}; where 
{vTi, . . . , TTfc} G Vn and {x-,^^, . . . ,x-n^,) G . Let be the set of all labelled 
partitions of {1, . . . ,n} . 

2. For each n G N, let pn denote the partition o/{l, . . . ,n} into singletons. Moreover, 
if E is the space of labels and x = (xi, . . . , a;„) G i?", let Pn(x) denote the element 
{({l},xi),...,(M,x„)} ofVi. 
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3. If TT ^ for some n G N, then bl(7r) will refer to the unlahelled partition of 
{1, . . . ,n} induced by vr and if a ^ bl(7r), Xa will be our notation for the label of a. 

Our genealogical process will be a labelled partition. As in classical representations of 
genealogical processes, a block of the partition at genealogical time t > contains the 
indices of the initial lineages which share a common ancestor t units of time in the past, 
and its label gives the current location of this ancestor in E = M?. 

From the description of the forwards-in-time dynamics, the evolution of a sample 
of ancestral lineages represented by a labelled partition should be the following. We 
start with a finite collection of lineages at time 0. At each point (— t, x,r) G 11 (with 
t > here, since genealogical time points towards the past), given that u G [0, 1] is 
the result of the sampling according to each lineage present in the ball B{x,r), 
independently, is affected (resp., is not affected) with probability u (resp., 1 — u). A 
site y is chosen uniformly in B(x,r), and the blocks of all affected lineages merge into 
a single block labelled by y. The other blocks and their labels are not modified. We 
write {A{t), t > 0} for the Markov process of coalescing lineages described in this 
way. Its state space is Un>i^n- Note that A is constructed on the same probability 
space as that of the Poisson point process of reproduction events. Writing P for the 
probability measure on that space, we abuse notation slightly by writing to indicate 
that ^(0) = A, P^-a.s. Now let us verify that our Condition ([5]) is sufficient to ensure 
that the process {A{t),t > 0} is well-defined. Since two lineages currently at separation 
y G will coalesce if they are both involved in a replacement event, which happens at 
instantaneous rate 

/ Lr{y,0) I / u^uridu)] fi{dr), (6) 

Condition ([5]) is more than enough to bound the rate of coalescence of ancestral lineages. 
To guarantee that we can fit together the measures p at different points in a consistent 
way, we also need to be able to control the spatial motion of ancestral lineages. Consider 
the (backwards in time) dynamics of a single ancestral lineage. It evolves in a series of 
jumps with intensity 

dt f f ui>r{du)p,{dr)dx (7) 

J{\x\/2,oo) J[0,1] 

on M_|_ X R^. If we want this to give a well-defined Levy process, then we require 

I {lA\x\'^)\ [ I :^^%^UK,((iu)/i((ir) I < oo. (8) 

JM? \J(|x|/2,oo) J[0,1] / 

But Condition ([5]) certainly guarantees this. In fact it ensures that the rate of jumps 
of each ancestral lineage is finite. In other words, ancestral lineages follow compound 
Poisson processes. 



11 



Remark 2.6 At first sight it is disappointing that we have to take Condition and 
hence obtain a system of coalescing compound Poisson processes rather than more gen- 
eral symmetric Levy processes that ^ and ^ would allow. However, biologically there 
is not much loss. The 'gap' between Condition ^ and the weaker Condition ^ is 
that the latter would allow one to include very large numbers of extremely small jumps 
(in which only a tiny proportion of the population is affected) as the radius of the area 
affected by a reproduction event tends to zero. But in our population model, for small r 
we expect that a large proportion of the population in the neighbourhood be replaced. 

Remark 2.7 Notice that the locations of ancestral lineages are not independent of one 
another. Knowing that one lineage has jumped tells us that a reproduction event has 
taken place that could have affected other lineages ancestral to our sample. Wilkins 
& Wakeley (2002) consider a somewhat analogous model in which a linear population 
evolves in discrete generations (see Wilkins 2004 foT a two-dimensional analogue). Each 
individual in the parental generation scatters an infinite pool of gametes in a Caussian 
distribution about themselves, and the next generation is formed by sampling from the 
pool of gametes at each point. Individuals are assumed to have a finite linear width to 
avoid the pathologies that arise when common ancestry in a continuum model requires 
two ancestral lineages to have a physical separation of zero. They observe that "condi- 
tional on not coalescing in the previous generation, two lineages are slightly more likely 
to be further apart than closer together". In their setting a change of coordinates set- 
tles the problem: the distance apart and the average position of two lineages do evolve 
independently. For us the dependencies between lineages are more complex because the 
presence of a jump contains the information that a reproduction event has taken place, 
whereas the conditioning obviously tells us nothing about the timing of events in the 
discrete generation model. 

3 The genealogy of points sampled uniformly from a large 
torus 

We now turn our attention to populations evolving on a two-dimensional torus of side- 
length L. Our goal is to describe the genealogy of a finite number of individuals sampled 
uniformly at random from the torus and subject to events of very different scales, as 
L — )• CO 

To this end, we now consider a family of models indexed by N. For each L G N, 
we consider a population evolving on the torus T(L) C M? of sidelength L. We identify 
T(L) with the subset [— L/2,L/2]^ of and use the Euclidean norm | • | induced on 
T(L) by this identification. Although Bf(^i-^[x,r) will be our notation for the ball in 
T(L) centred in x and with radius r, we shall omit the subscript when there is no risk 
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of confusion. 

The population will be subject to two different classes of events that we call small and 
large. The region affected by each small event will be uniformly bounded (independently 
of the size of the torus). Large events will affect regions whose diameter is on the order 
of ipL which will be taken to grow with L, but they will be less frequent. We shall 
assume that the rate at which a given ancestral lineage is affected by a large event is 
proportional to l/pi with also chosen to grow with L. 

Now let us make the model more precise. Let ('i/'L)L>i be an increasing sequence 
such that there exists a G (0, 1] satisfying 

lim — = a, (9) 

L^oo logL ^ ^ 

and assume that |alogL — logV'il = o{{log L)^^/"^) as L — )• oo. 

Remark 3.1 The latter assumption is not necessary since all our results would still 
hold with each occurrence of (1 — a)logL replaced by log{L'ip2^) (see the end of the 
proof of Proposition \6.^} . but it is weak and considerably simplifies the presentation. 



Let (pl)l>i be an increasing sequence with values in (0, +oo], tending to infinity as 
L — )• oo. Finally, let p^{dr) and p^{dr) be two u-finite Borel measures on (0,oo), 
independent of L, such that there exist some positive constants W and satisfying 

inf {ii : ^''((i?,oo)) = 0} = i?" < cx) and 'm.i [R : p^ {{R, oo)) = = < oo . 

(For convenience, we ask that R^ < l/\/2 if a = 1.) To every r > 0, we associate two 
probability measures v^{du) and v!^{du) on [0,1], and we assume that for ★ G {B,s} 
and for each e G (0, i?*), 

/x^({r G [i?'^-e,i?1 :K;r/5o}) >0. (10) 

If Condition (|10p does not hold, we decrease the corresponding radius R* since otherwise 
the largest events never affect a lineage. 

Let us suppose that for each L>1, the reproduction events of the forwards in time 
model can be of two types : 

• Small events, given by a Poisson point process on M x T(L) x (0, oo) with 
intensity measure dt® dx ® fj,^{dr). If (t, x, r) is a point of n|^, then the centre of 
the reproduction event is x, its radius is r and the fraction of individuals replaced 
during the event is chosen according to I'j.. 

• Large events, given by a Poisson point process n£ on M x T(L) x (0,oo), inde- 
pendent of n£ and with intensity measure {pLil^i)^^dt dx p,^{dr). If {t,x,r) 
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is a point of n£, then the centre of the reproduction event is x, its radius is ^/^^r 
and the fraction of individuals replaced during the event is chosen according to 



Notice that we allow pi^ to be infinite, in which case large events do not occur. Since 
nf^ and nf are independent, the reproduction events could be formulated in terms of a 
single Poisson point process to fit into the Definition 12.21 of the spatial A-Fleming-Viot 
process. However, our aim here is to disentangle the effects of events of different scales, 
hence our decomposition into two point processes. 

Remark 3.2 Observe that, although the intensity of Hf^ is proportional to (p^.V'i)^^! 
the rate at which a lineage is affected by (that is, jumps because of) a large event is of 
order 0{p]^^). Indeed, the volume of possible centres for such an event is proportional 
to Tpf^, so that the jump rate of a lineage due to the large events is given by 

iriipLrfu vf{du)p^{dr) = — r'^u {du) p^ {dr) . 



In order for the genealogical processes, which we now denote by to emphasize 
dependence on L, to be well-defined for every L G N, we assume that Condition ([5]) is 
fulfilled. In this setting, the condition can be written 

r'^u v^{du)iM'{dr) H / / r^n v^{du)p^{dr) < oo. 

PL Jo Jo 

Let us introduce some more notation. We write 

L 



r(L,l) = |xGT(L):N>^^^^ 

and for each integer n > 2, 

r(L,n) = {{xi,...,x„}ET(Lf :|xi-Xj|>^ foralH/i}, 

r^(L,n) = |{(ai,XaJ,...,(afe,XaJ} GP^ : G r(L,A;)|, 

where as before denotes the labelled partitions of { 1 , . . . , n} . When we require an 
element A of T_^{L,n) to have exactly n blocks, we shall write A G T_^{L,n)* . 

In order to obtain a non-trivial limit, we rescale time for the process A^ by a factor 
that we denote vul. Recall that if A for some n G N, hl{A) stands for the unlabelled 
partition of {1, . . . ,n} induced by A. For each L G N, let us define the (non-Markov) 
process A^'"^ by 

A^'^{t) =hl{A^{wLt)), t>0. 

Note that for each L G N, if we start A^ from Al, a labelled partition of {1, . . . ,n} 
with labels from T(L), then A^'"^ takes its values in the Skorohod space D-p^[0,oo) of 
all cadlag paths with values in Vn (the set of partitions of {!,... ,n}), P^^-a.s. 
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Recall the definition of a given in ([9]). In the absence of large events, our model is 
similar in many respects to the two-dimensional stepping stone model and so it comes as 
no surprise that just as for the stepping stone model, the genealogy of a random sample 
from the torus should converge (on a suitable timescale) to a Kingman coalescent as 
the size of the torus tends to infinity (see in particular Cox & Griffeath 1986,1990, Cox 
& Durrett 2002 and Zahle et al. 2005 for precise statements of this result in different 
contexts). Our first result says that if a < 1, then we still obtain a Kingman coalescent, 
but the timescale will be influenced by the large events: the latter reduce the effective 
population size. 

Before stating the result formally, let us try to understand why we should expect 
something like this to be true. To understand the appropriate timescale we just need 
to consider two lineages. The time they need to coalesce will be decomposed into two 
phases. If pL is not too big, the first phase will be the time until they first come 
within distance 2i?^^/^i and the second will be the additional time required for them to 
coalesce. During the first phase they evolve according to independent compound Poisson 
processes. If is small enough, the coalescence event that will eventually occur during 
the second phase will, with probability close to one, be triggered by a large event. For 
larger values of pi, large events will not be frequent enough to hit the two lineages when 
they are at a distance that would allow them to coalesce (i.e., less than 2R^iPl), and 
coalescence will instead be caused by a small-scale event. The first phase is then taken 
to be the time until the lineages first come within distance 2R^ of one another. The fact 
that with high probability they will not be hit by the same large-scale event means that 
once again they evolve (almost) independently of one another during this first phase. 
The second phase is now the time taken for them to coalesce due to a small event. The 
transition between these two regimes is when pi oc V'i log L. Now suppose that we start 
from a sample in T{L,n). The first phase is then long enough that, when it ends, the 
spatial location of lineages is no longer correlated with their starting points. Finally, 
why do large-scale events not lead to multiple mergers? The key point is that, when a 
pair of lineages ancestral to our sample first comes within 2R^^pL of one another, all 
other pairs are still well-separated. So if pL is not too big, this pair will coalesce before 
a third lineage can come close enough to be affected by a common event. If we take 
larger p^, the reason is exactly the same but now lineages have to come within distance 
2R^ and coalescence is driven by small events. 

Here then is the formal result which makes explicit the convergence in distribution 
of our spatial genealogies to a nonspatial coalescent process. In the following, (resp., 
(^b'^lPl^) variance of the displacement of a lineage during one unit of time due 

to small (resp., large) events, see (pO|) below. 

Theorem 3.3 Let /C denote Kingman's coalescent, and recall that for each n G N, pn 
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denotes the partition of {1, . . . ,n} into singletons. In the notation of suppose a < 1 
(and [TOjj holds). Then, for each integer n > 2 and any sequence (^L)LgN such that 
Al G r^(L,n)* for every L, 



where 



oo, 



(i^gg^ if ,Z Vi ^ 6 € [0, oo) and ^ ^ oo, 

^ if {Pl^'4^l)l>i is bounded or '^"^^ — )• 0. 



-fTere £p(X) denotes the law under the probability measure P of the random variable X 
and =^ refers to weak convergence of probability measures. 

For a = 1, things are more complicated. When ijjL is commensurate with L, large scale 
events cover a non-negligible fraction of the torus. If they happen too quickly, then they 
will be able to capture multiple lineages while the locations of those lineages are still 
correlated with their starting points. For intermediate ranges of pi, lineages will have 
homogenised their positions on T(L) through small events, but not coalesced, before 
the first large event occurs and we can expect a A-coalescent limit. If the large events 
are too rare, then coalescence will be through small events and we shall recover the 
Kingman coalescent again. 

To give a precise result we need to define the limiting objects that arise. In the case 
a = 1, for each L G N, we set 

{PL if pl/ {L'^^ogL) has a finite limit, 

^ iipU{LHogL)^+^, 

and define A^'"^ as before. Since we shall need to keep track of the labels (spatial 
positions) of the ancestral lineages in some cases, it will also be convenient to introduce 
the following rescaling of A^ , evolving on T(l) for all L G N: 

A^{t) = J A^iwLt), t > 0, 

where by this notation we mean that the labels are rescaled by a factor L^^. Similarly, 
for X G T(l)" we write Lx for (Lxi, . . . , Lxn) G T(L)". Finally, let us introduce the 
processes which will appear as the limits of our rescaled genealogical processes. 

Definition 3.4 Let b G [0,oo) and c > 0. We call A°°'''''^ the Markov process with 
values in UnGN^n (with labels in T(l)j such that 

1. The labels of the lineages perform independent Brownian motions on T(l) at speed 
bal (ifb = 0, the labels are constant), until the first large event occurs. 
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2. Large events are generated by a Poisson point process n on M X T(l) X (0,l/\/2] 
with intensity measure c~'^dt® dx ® ^^{dr). At a point {t,x,r) ofH^, a number 
u G [0, 1] is sampled from the probability measure , and each lineage whose label 
belongs to iJ-jp^^-j (x, cr) is affected (resp., is not affected) by the event with proba- 
bility u (resp., 1 — u), independently of each other. A label z is chosen uniformly 
at random in Bj(^i){x,cr), and all the lineages affected merge into one block which 
adopts the label z. The other lineages (blocks and labels) remain unchanged. 

3. The evolution of the labels starts again in the same manner. 

Remark 3.5 Notice that this process looks like another spatial K-coalescent, except 
that now ancestral lineages perform independent spatial motions in between coalescence 
events. This process is dual (in the obvious way) to a spatial A-Fleming-Viot process in 
which, during their lifetimes, individuals move around in space according to independent 
Brownian motions. 

For each r £ [0, l/^/2], let Vr denote the volume of the ball 5^(1) (0, r). 

Definition 3.6 Let (3 G [0,oo) and c > 0. We use K^l^'^^ to denote the K-coalescent, 
defined on UneN^"' which if there are currently m ancestral blocks, then each tran- 
sition involving k of them merging into one happens at rate 

^ =c-' / {V,ru)''{l - V,rny^->'u^{du)f,^{dr) + /3 6{k=2}- 

Jo Jo 

Recall the notation p„ and pn(x) introduced in Notation 12.51 and Cp{X) and =^ intro- 
duced in the statement of Theorem 13.31 We can now state the result for a = 1. 

Theorem 3.7 Suppose there exists c > such that for every L E N, Vl = cL. Let 

n G N, X G ¥(!)"■ such that Xi ^ Xj whenever i ^ j, and let {AL)Lef>i be such that for 
every L, Al G r^(L,n)*. Then, as L ^ oo, 

(a) //plL-2^6g [0,oo), 

(b) If PlL~^ oo, ^"^gi — 7- /3 G [0, cxd) and if the total rate of occurrence of large 
events is finite (i.e., fi^ has finite total mass). 
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Notice that the case (a) differs from all other cases in that the influence of space does 
not disappear as L — )• oo and the evolution of the limiting genealogy still depends on 
the precise locations of the lineages. 

The intuition behind Theorem 13.71 is as follows. If ipL on L large events cover a non- 
negligible fraction of the torus, and so only a few large events are sufficient to gather 
two lineages at a distance at which they can coalesce. However, a local central limit 
theorem will give us that on a timescale of order at most 0{L?'), a lineage subject to 
only small events behaves approximately like Brownian motion, whereas after a time 
tL ^ L"^, its distribution is nearly uniform on T(L) (for L large enough, see Lemma 
15. 4|) . Since the mean time before a large event affects a lineage is of order 0{pl), 
the limiting genealogical process (when we include both large and small reproduction 
events) will depend on how scales with L^. If /o^ is of order at most 0{L'^), then space 
matters and the process rescaled to evolve on T(l) on the timescale pL converges to 
a system of coalescing Brownian motions, whereas if pL L'^, the homogenisation of 
the labels/locations of the lineages before the occurrence of the first large event which 
affects them leads to a limiting unlabelled genealogical process given by an exchangeable 
coalescent with multiple mergers. 

Remark 3.8 It is somehow disappointing that we must impose a finite rate of large 
events to obtain the convergence of Theorem \3. T^ b). Indeed, it seems that case (a) 
should give us the right picture: in the limit, in between large events lineages perform 
Brownian motions on the torus of sidelength 1 due to small events, except that now 
the time required for at least one lineage to be affected by a large event is so long that 
lineages exhaust space and their locations become uniformly distributed over the torus 
before they are taken by a coalescence event. However, when p^ has infinite mass, 
lineages are infinitely often in the (geographical) range of a large reproduction event 
over any interval of time, and we need good control of their complete paths to actually 
be able to say something about the epoch and outcome of the first potential coalescence 
event. Now, observe that Equation ^54\ ) can only be generalized to the finite- dimensional 
distributions of these paths, and does not guarantee that a large event cannot capture 
some of the lineages at a time when they are not uniformly distributed over T(l). 

Theorem 13.71 deals with the case where ipL is proportional to L. Let us now comment 
on the remaining cases, in which a = 1 but ipi <^ L. First, it is easy to see that the 
convergence in (c) still holds, since it is based on the fact that large events are so rare 
that none of them occurs before small events reduce the genealogical process to a single 
lineage. 

Second, since the total rate of large events on the timescale pi is /i'^(M+)L^/V'|,, it 
cannot be bounded unless p^ = (a situation we excluded in (fTOj) ). On the other hand. 
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for the reason expounded in Remark 13.81 we are unable to derive a limiting behaviour 
for the genealogy when large events can accumulate, and so the result of Theorem 13. 7( 6) 
has no counterpart when ipi <^ L. 

Third, as explained above, when p_L < any limiting process will necessarily have 
a spatial component. Now, because we start with lineages at distance 0{L) of each 
other, we need to rescale space by L in order to obtain a non trivial initial condition. 
The last parameter we need is the timescale wl on which to consider the genealogical 
process. But a separation of timescales will not occur here, and so the computations 
done in Section [5] will show that the suitable choice of zul depends on the precise 
behaviour of Pl/L"^ and pL/ipj^. Several limiting processes are thus possible, and since 
all the arguments needed to derive these limits are scattered in Sections [S] and [71 we 
chose not to detail them here. 

4 Existence and uniqueness of the forwards- in-time pro- 
cess 

Our spatial A-Fleming-Viot process associates a probability measure on type space 
to each point in M?. In other words, it takes its values among functions from to 
A^i([0, 1]). Evans (1997) uses duality with a system of coalescing Borel right processes 
on a Lusin space E to construct a family of Markov processes with values in the set 
of functions from E to A^i({0, 1}^) (or equivalently, to A^i([0, 1])). He also obtains 
uniqueness in distribution of the process. In his setting, coalescing particles evolve 
independently until they meet, at which point they instantly coalesce. In our case, the 
particles in the candidate dual do not move independently and nor do two particles 
hit by the same reproduction event necessarily coalesce, but nonetheless the key ideas 
from his construction remain valid. Note that, although we present the result in two 
dimensions, the proof carries over to other dimensions. 

First we give a formal description of the coalescing dual and then we use the Evans' 
construction to give existence and uniqueness in law of a process p which assigns a 
probability measure on [0, 1] to each point in M^. We then identify p as the spatial 
A-Fleming-Viot process in which we are interested. 

4.1 State-space of the process and construction via duality 

We shall only present the main steps of the construction, and refer to Evans (1997) for 
more details. 

Let us define H as the space of all Lebesgue-measurable maps p : — A^i([0, 1]). 
Two elements pi and p2 of H are said to be equivalent if Leb({2; G : pi{x) ^ 
P2ix)}) = 0. Let H be the quotient space of H by this equivalence relation. If E is 
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a compact space, let us write C{E) for the Banach space of all continuous functions 
on E, equipped with the supremum norm || • ||oo- For each n G N, let L^(C([0, 1]**)) 
be the Banach space of all Lebesgue-measurable maps $ : (R^)" — )• C([0, 1]") such 
that /(]{j2)n ||*^*(2;)||oo dx < oo. A remark in Section 3 of Evans (1997) tells us that 
the separability of L^(C([0, 1])) and a functional duality argument guarantee that H, 
equipped with the relative weak* topology, is a (compact) metrisable space. Finally, if 
A is a measure on a space E' , let us write L^{X) for the set of all measurable functions 
f : E' such that /g, \f{e)\X{de) < oo. 

Let n E N. Given $ G L^C{[0, 1]")), let us define a function /„(• ; $) E C(S) by 

Inip;^)= / ( (R) p{Xi),^{xi, . . . ,Xn)) dxi . . .dXn, 

where as before the notation (z^, /) stands for the integral of the function / against the 
measure v. We have the following lemma, whose proof is essentially that of Lemma 3.1 
in Evans (1997). 

Lemma 4.1 The linear subspace spanned by the constant functions and functions of 
the form In{- ;«>), with ^ = {W^^^Xi) , e L^{dx®'^)r\C{{R^Y) and Xi G C([0,1]) 
for all 1 < i < n is dense in C(S). 

We need a last definition before stating the existence and uniqueness result. Let 
n G N. For any p G H, vr G such that bl(7r) = {ai,...,afc}, and any bounded 
measurable function F : [0, 1]" — t- M, we set 

Tnip;7^;F)= / F(u„-i(i), . . . , t;„-i(„))p(XaJ(dfaJ . . . p(XaJ(dfaJ, 
J[0,1]'= 

where a~^{i) is the (unique) block aj which contains i and Vaj is the variable used for 
the measure p{xaj). In words, we assign the same variable to all coordinates which 
belong to the same block in the partition vr. (Recall that Xa is our notation for the label 
of block a.) Recall also the notation pn(x) and A introduced in Notation 12.51 and the 
following paragraph. 

Theorem 4.2 There exists a unique, Feller, Markov semigroup {Qt,t > 0} on H such 
that for allnGN and $ G L^(C([0, l]")), we have 

[ Qt{p,dp')In{p;<^)= [ E^^(^^)[Tn{p;Aity,<^{xu...,Xn))]dXi...dXn. (11) 



Consequently, there exists a Hunt process {p{t),t > 0} with state-space E and transition 
semigroup {Qt,t > 0}. 

Before proving Theorem 14.21 Ist us make two comments on this result. First, since 
the H-valued process we obtain is a Hunt process it is cadlag and quasi- left continuous. 
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that is, it is almost surely left-continuous at any previsible stopping time (see e.g. 
Rogers & Williams 1987 for a definition of quasi- left continuous filtrations) . However, 
more precise statements on its space-time regularity seem to be a delicate question, 
which will require a thorough investigation. 

Second, as in Kimura's stepping stone model introduced in ([1]), the duality relation 
(jlip can be interpreted in terms of genealogies of a sample of individuals. Indeed, recall 
the stepping stone model is dual to the system i £ I})t>o of particles migrating 

from deme i to deme j at rate mji and coalescing in pairs at rate l/N^ when in the 
same deme: for any t > 0, we have 



E 



These equations show that a function (here the nj(0)-th moments) of the frequencies at 
different sites of and at (forward) time t can be expressed in terms of the genealogy 
of a sample made of nj(0) individuals in deme i for every i €z I, and run for a (backward) 
time t: all lineages having coalesced by time t necessarily carry the same type, whose law 
is given by the type distribution at the site where their ancestor lies at backward time t 
(or forward time 0). Equation (|lip can be interpreted in exactly the same manner, but 
holds for a much wider collection of functions of p and A. 

Proof of Theorem \4.2\ - The observation that the construction of Evans (1997) can also 
be justified in our setting follows from Remark (a) at the end of his Section 4. 

Existence and uniqueness of A are easy from Assumptions ([6]) and ([8]). Next, we 
must verify consistency of A in the sense of his Lemma 2.1. In fact, this is the 'sampling 
consistency' described in the introduction and was a primary consideration in writing 
down our model. It follows since the movement of the labels of a collection of blocks 
does not depend on the blocks themselves and from the fact that a coalescence event 
of the form {({1}, xi), ({2}, X2)} —5- {({l,2},x)} for a pair of particles corresponds to a 
jump {({l},xi)} — )■ {({l},a;)} onto the same site x G if we restrict our attention to 
the first particle. 

The next property needed in the construction is that provided it is true at t = 0, 
for every t > the distribution of the labels in A{t) has a Radon-Nikodym derivative 
with respect to Lebesgue measure, and furthermore an analogue of Evans' Equation (4.2) 
holds. In the setting of Evans (1997), the first requirement stems from the independence 
of the spatial motions followed by different labels and the corresponding result for a 
single label. Here, since the motion of all lineages is driven by the same Poisson process 
of events, their movements are correlated. However, the desired property is still satisfied. 
To see this, note that each jump experienced by a lineage in the interval [—t, 0] takes it to 
a position that is uniformly distributed over the open ball affected by the corresponding 
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reproduction event. Thus, if A{t) has k blocks and D C (M^)'^ has zero Lebesgue 
measure, the probabihty that the labels of the blocks of A{t) belong to D is equal to 0. 
Equation (4.2) of Evans (1997) then still holds, without Evans' additional assumption of 
the existence of a dual process for the motion of one lineage (which anyway is satisfied 
since our lineages perform symmetric Levy processes). 

The last step is to check the strong continuity of the semigroup {Qt,t > 0}, but 
this readily follows from the relation (jlip and the Feller property of A (which is itself 
evident since jumps do not accumulate in our dual process). 

The desired conclusion now follows from Theorem 4.1 in Evans (1997). □ 

4.2 Identification of the process 

We can use (|lip to derive an expression for the infinitesimal generator of {p{t),t > 0} 
acting on the functions /„(• ; $) considered in Lemma |4. II This lemma and the unique- 
ness result stated in Theorem 14.21 guarantee that it will be sufficient to characterize the 
process p and to show that it corresponds to the evolution we described in Section [2] in 
terms of a Poisson point process of reproduction events. 

Let n G N and ^> G C{E) be such that ^) = (g) (HiLi Xi), where ip G L^((ix®") n 
C{{M?)"') and Xi ^ ^"([0, 1]) for all 1 < i < n. Writing G for the generator of the process 
p and Qn for the generator of the coalescing Levy processes A acting on functions of 
V^, we obtain from pT|) that 

Ep[iMt),<^)]-Up,^) 



G/„(/>;f) 



lim ■ 



lim^ I ij{xi, . . . ,Xn)\Ep„(^)hn(p;A{t);f\ 

n 

Ipixi, ... ,Xn) Qn ]T\ X 

\n L \ 



1=1 



Xi),Xil 



dx 



(p„(x)) dx 



(12) 



Note that the quantity on the right-hand side of (jl2p is well-defined (and the interchange 
of limit and integral is valid) since belongs to L}{dx®"') and the rate at which at least 
one oi k < n blocks is affected by a reproduction event is bounded by n times the 
integral in ([S]), so that ^ is a jump- hold process and its generator satisfies 



n 

i=l 



< 2Cn 



^n[p 



4 = 1 



< 



2C7n]^ 



Xi 



< oo 



4 = 1 



for a given constant C < oo. 

Using the description of the evolution of A in terms of events in H, the right-hand 
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side of ([T2]) is equal to 



dx^^-ipixi, . . . ,Xn) [ dy [ ii{dr) [ Vr{du) [ 

JlR2 Jo Jo JB{y,r) ^Tr^ 

I(Z{l,...,n} '-ie/ i'0 



, (13) 



where | • | stands for cardinality. Indeed, given xi, . . . ,x„ in ()13p . only one term in the 
sum over / C {1, . . . ,n} is non-zero. For this particular term, each of the \I\ blocks 
whose labels lie in B{y,r) belong to the set J of the blocks affected by the event with 
probability u (independently of one another), and the affected blocks adopt the label z. 
After some algebra and several uses of Fubini's theorem, we obtain that p3p is equal 
to 

r dz 

'B(y,r) 7rr2 Jq 

X Yl Tl{'^B{y,rr{Xj){px,,Xj)}Yl'i-B{y,r){Xi) 
IC{l,...,n}j0 



dy ji (dr) / i>r{du) 
Jo Jo 



Pz{dk) i dxi . . . dxn i>{xi 



, . . . , j 



(14) 

which is precisely the generator of the forwards in time process of Section [2j Using 
Theorem 14.21 we arrive at the following result. 

Proposition 4.3 The martingale problem associated to the operator G defined by ( [j^p 
on functions of the form given in Lemma \4-l\ is well-posed. Furthermore, the spatial 
A-Fleming- Viot process p of Theorem \4-^ is the solution to it. 



5 Some estimates for symmetric Levy processes 

In this section, we gather some results on symmetric Levy processes that we shall need 
to call upon in our proofs of Theorem 13.31 and Theorem 13. 7i For the sake of clarity, the 
proofs of the three lemmas are given in Appendix Rl 

First, we introduce some notation that we shall use repeatedly. 

Notation 5.1 1. In the following, we shall suppose that all the random objects con- 
sidered are constructed on the same probability space (ri,J^, P), and if X is a 
process defined on Q with state-space E and x £ E, we shall write Fx for the 
probability measure on Q under which X{0) = x a.s. 
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2. For a stochastic process {Xt}t>Q evolving in 'ir(L), we shall write T{R,X) for the 
first entrance time of X into Bj^i}{0, R). When there is no ambiguity, we write 
simply T{R). 

Let {i^)L>i be a sequence of Levy processes such tliat for eacli -L G N, £^ evolves 
on tlie torus T(L) and — £^{0) lias a covariance matrix of the form o"£ld. Assume 
that the following conditions hold. 

Assumption 5.2 (i) There exists > such that cr| — t- o"^ as L ^ oo. 

(ii) Eq [1^^(1)1''] is bounded uniformly in L. 

Our first lemma describes the time £^ needs to reach a ball of radius dL L around 
0, when it starts at distance 0{L) of the origin (recall the definition of T{L, 1) given in 
Section [3|) . 

Lemma 5.3 Let {dL)L>i be such that liminfL_j.oo di > and '"^^J^^-* — ;> 7 G [0, 1) as 
L — )• 00. Then, 



lim sup sup 



■ XL 



0. 



(15) 



The proof of Lemma [5.31 follows that of Theorem 2 in Cox & Durrett (2002). In partic- 
ular, we shall use the following local central limit theorem (which is the counterpart in 
our setting of Lemma 3.1 in Cox & Durrett 2002). Let [z\ denote the integer part of 
zeR, and write p^{x,t) for F^[l^{t) G 5(0, dz,)]. 

Lemma 5.4 (a) Let el = (logL)^^/^. There exists a constant Ci < 00 such that for 
every L > 2, 



sup sup 

(b) If vl ^ 00 as L ^ 00, then 



4 



P^{x,t) < Ci 



(16) 



lim sup sup 



p^{x,t) 



.dl 



L2 



0. 



(17) 



(c) // — )• 00 as L —)• 00 and I{dL,x) = 1 + (|xp V d^), then 



lim sup sup 

L^oo a.g']r(L) ULl(dL,x)<t<eLL2 



—^p^{x,t)-l 



0. 



(d) There exists a constant C2 < 00 such that for every L > 1, 



sup sup ( 1 + -35- ) p^(x,t) < C2. 
t>0 zeT(L) V "l 



(19) 
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In essence, Lemma 15.41 savs that on the timescale d| <^ t <^ , the Levy process 
behaves hke two-dimensional Brownian motion, whereas at any given time t ^ L? , its 
location is roughly uniformly distributed over T(L). 

Another consequence of Lemma [5.41 is the following result, which bounds the proba- 
bility that hits a ball of bounded radius during a 'short' interval of time in the regime 
t » L\ 

Lemma 5.5 Fix R > 0. Let {Ul)l>i Oind {ul)l>i be two sequences increasing to 
infinity such that UlL""^ — )• oo as L oo and 2ul < L^(log L)^-*^/^ for every L > 1. 
Then, there exist C > and Lq £ N such that for every sequence {U'j^)l>i satisfying 
U'j^ > Ul for each L, every L > Lq and all x £ T(L), 

CUL 



6 Proof of Theorem 13.3 



< 



L2 



Armed with the estimates of Section [SJ we can now turn to the proofs of our main 
results. 

Notation 6.1 For each L > I, let > 0} be the Levy process on T{L) whose 

distribution is the same as that of the motion of a single lineage subject to the large and 
small reproduction events generated by and n£. 

In the rest of this section, we assume that the assumptions of Theorem l3.3l are satisfied. 
6.1 Coalescence time for two lineages 

We begin by studying the genealogical process of a pair of lineages starting at distance 
0{L) from each other. Since the motions and of the lineages are distributed like 
two independent copies of the process until the random time Tl at which they come 
at distance less than 2i?^^i, the difference 

X^{t)^i{{t)-i^{t), 0<t<TL 

has the same distribution as {C^(2t), < t < ^ T{2R^ipL,i^)] ■ We shall use 
Lemma 15.31 to derive the limiting distribution of T^, but first we need to introduce 
the relevant variances. Consider a single lineage. Because it jumps at a finite rate ow- 
ing to small and large events, the following two quantities are well-defined and finite : 

(^1 = j X\dy,dz) and a% = j y^ x^idy,dz), (20) 

where stands for the intensity measure of the small jumps experienced by the lineage 
and for that of the large jumps renormalised by (the form of these two measures 
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is given in ([7])). We now have all the ingredients we need to describe the asymptotic 
'gathering time' of two lineages. 

Proposition 6.2 (a) If p2^%j)\ — t- oo as L ^ oo, then 



lim sup sup 

t>0 Ai,Gr^(L,2)* 



■ {l-a)pLL^ logL 
Jl > ; ^^—t; t 



(6) // Pi^ipl 6 G [0, oo) as L ^ oo, then 



lim sup sup 



Tl > 



2vra|,Vi 



:i-a)L2 logL 



0. 



t 



-t 



0. 



Proof of Proposition 1 6. Let us first recall two results on Poisson point processes, 
which are consequences of the exponential formula given, for instance, in Section 0.5 of 
Bertoin (1996). Following Bertoin's notation, let {e(t), t > 0} be a Poisson point process 
on M X with intensity measure n{dy) ® dt, where the Borel measure k satisfies 



|1 — e^\K{dy) < oo 



and /y™K(dy) = 0, mG{l,3}. 
Jr 



Under these conditions, we have for each fixed t > 

2" 



E 



E 



s<t 
s<t 



t / y nidy), 



■ 3t' 



y'n{dy) ] +t y^nidy) 



(21) 



(22) 



(23) 



These properties will be useful in computing the variances and fourth moments of the 
random variables considered below. 

Let us start with the proof of (a). Consider the process £^ defined by: for every 
t > 0, 

l\t) = ^e{2pLt). 

This process evolves on the torus of sidelength iIjJ^^L, and makes jumps of size 0{iIjJ^^) 
at a rate of order 0{pl), as well as jumps of size 0(1) at a rate of order 0(1). 

Let us check that £^ satisfies the assumptions of Lemma 15.31 To this end, we view 
starting at as the sum of its jumps and adapt the problem to use the results on 
Poisson point processes given above. First, let us define £^ as the Levy process on 
evolving like £^ (but without periodic conditions). For i G {0, 1} and each L > 1, t > 0, 
let £^'^{t) denote the i-th coordinate of £^{t). Note that the distance reached by £^ up 
to a given time t is less than or equal to the distance at which (.^ traveled up to t, and 
so we can write 

Eo[K^(l)|^] <Eo[|i^(l)|^] =Eo[{i^'i(l)2+i^'2(l)2}'' 

<2|Eo [i^'^' 



im +E( 
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By symmetry, we need only bound Eq [^^'^(1)'^] . Let us denote by oi, 02, .. . G [— 2i?'^/'i/'L, 
2R'/^Lf (resp., 61,62,. ■■ G [-2R^ ,2R^f) the sequence of the jumps of l^'^ before time 
1 due to small (resp., large) events. Using the convexity of y 1— )• y^, we have 



En 



< 8 E, 



(24) 



Applying (j23p to each term on the right-hand side of (j24p yields 
Eo[(i^'^(l))'*] <96^ ""'^^^fj / yV(dy,'i^) + 964 + 16 ^ yV(^i2/,d^), (25) 

which is bounded uniformly in L since Pbi^J^^ vanishes as L grows to infinity, and each 
integral is finite. Coming back to the original problem, we obtain that Assumption 15.21 
(ii) holds for the sequence of processes (^^)l>i- 

Concerning Assumption 15.21 (i), observe that a\ is simply the variance of £^'^(1). 
To obtain the asymptotic behaviour of a\, we show that up to time 1, £^ does not see 
that it is on a torus. Hence, with high probability £^'^(1)^ = ^^'^(1)^ and so 

Eo[£^'Hl)'] ^Eo[i^'Hl)2] =2^ j yWdy,dz)+2 j y\^ {dy , dz) = 2al + o{l) 



as L — )• 00, where the second equality uses (|22|) . To make the first equality rigorous, 
we apply Doob's maximal inequality to the submartingale |^^|^. This yields, with a 
constant C > which may change from line to line. 



sup |£^(.)| > 

0<s<l ^WL 



< 



^Eo [1^^(1)11 



But the calculation in (|25p shows that the latter expectation is finite, and so 

L 



sup \£'^{s)\> 

0<s<l 'JWL 



<-4- 



(26) 



On the event £l = ^ supQ<;g<]^ l^^(s)l ^ 3^}' paths of and can be coupled so 
that l^{s) = i^{s) for every s G [0, 1], and since these quantities are bounded for each 
L we can write 

Eo[(£^'Hl))2] =Eo[(i^'Hl))' Is,] +Eo[(^^'^(l))2 l£i] 

= Eo[(^"^'Hl))'] -Eo[(l^'ni))' l£i] +Eo[(^^'i(l))2 Isi]. (27) 

By (I26p and the fact that evolves on the torus of size LipJ^^ , the last term on the 
right-hand side of (p7|) is bounded by 

c£x^ = cS^O asL^oo. 



L2 
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For the second term on the right-hand side of (p7|) . let sl{1) = supo<3<i Using 
Fubini's theorem on the second line, we have 



Eo[(l^'Hl))' l£i] <IEo[sl(1) 



9V'' 



sl{1) > 



L 


dy 


L 1 




/•oo 

] 


3^ J 


-J 





> Vy] dy. 



(28) 



Now, by the argument leading to Po['5l(1) > y^] is bounded by Cy ^ for each 

y > 0, where C is a constant independent of y. Consequently, the right-hand side of 
(f28]) is bounded by 



dy 

2 



L2/{94,1) y 

Coming back to (j27|) . we can conclude that 

al = 2al + 0(1) 







as L 



as L 



oo. 



oo. 



If we now recall the equality in distribution described at the beginning of the section, 
we can use Lemma 15.31 applied to on the torus of size Lip^^ the entrance time 
into B{0,2R^) to write that 

^2- 



lim sup sup 



> — ^ t 



27rcr| 



0. 



(29) 



By the assumption on |a; log L— log('i/'L)| introduced just after Q and Lemma [53] applied 
to to bound the probability that lies between 



and 



(l-a)pLL^logL 



(a) of Proposition 16.21 follows from 

Let us now turn to the proof of (6). This time, we define £^ for every t > by 

Similar calculations give, as L — )• oo, 

Eo[\£''{l)\^] =2al + 2bal + o{l) if V! ^ & G [0, oo). 



and Eq [1^^(1)1^] is bounded uniformly in L. We can therefore apply Lemma 15.31 to £^ 
as above. □ 

Having established the time that it takes for two lineages starting from distance 
L apart to come close enough together that they have a chance to coalesce, we now 
calculate the additional time required for them to actually do so. We shall have to 
distinguish between several regimes, depending on whether large or small events prevail 
in the evolution of the pair of lineages. Our goal in the rest of this section is to prove 
the following result. 
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Theorem 6.3 For each L G N, lettL denote the coalescence time of the pair of lineages 
under consideration. Then, 
(a) If ^ ^ CO as L ^ oo, 



ib)If^ 



lim sup sup 

■C'^oo t>o yi£er^(L,2)* 

b G [0, oo) and 



Pa, 



tL > z T^—^ t 



0. 



PL 



lim sup sup 

-f'-s-oo t>o Ai,er^(L,2)* 



(c) If {j^)l>i is hounded or ^^j^ 



oo as L 



tL > 



as L 



-)• oc, 

;i-a)L^ logL 
27r(a2 + 6a|) 



/ , ^/)? log L 
OO (ana so 



PL 



lim sup sup 



Ai 



L^logL 



0. 



0. 



The cases (a) and (6) are separated only because the timescales of interest are not of the 
same order, but the reasons why they hold are identical: in both cases, large jumps are 
frequent enough that, once the lineages have been gathered at distance 2R^ipL, they 
coalesce in a time negligible compared to Tl- In contrast, in (c) we assume that the 
rate at which the lineages are affected by large events is so slow that we have to wait 
for the lineages to be gathered at distance less than 2R^ before they have a chance to 
coalesce (and they do so in a negligible time compared to log L) . If none of the above 
conditions hold, then the proof of (c) will show that, also in this case, the probability 
that a large event affects the lineages when they are at distance less than 2R^iI)l and 
before a time of order ©(L^logL) vanishes as L tends to infinity. However, we are no 
longer able to describe precisely the limiting behaviour of i^, see Remark 16.81 

Let us first make more precise the sense in which the additional time to coalescence 
is negligible once the lineages have been gathered at the right distance. 



Proposition 6.4 Let ($l)l>i a sequence tending to infinity as L 



oo. 



(«) Ifm 



L>1 



is such that 



PL 



i>'i log *i 



as L — 7- oo, we have 



hm sup Pa^ [tL > ^lPl] = 0, 



(30) 



where the supremum is taken over all samples = | ({!}, x{'), ({2}, ^2 )} such that 
\x{ - < 2R^ijL- 

(b) Under no additional condition, we have 



hm sup P^, [tL > <^l] = 0, 



(31) 



where the supremum is now taken over all samples A'^ = | ({1}, a;{'), ({2}, )} such 
that |xf - x^\ < 2R'. 
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Taking $l = p^fogL ^ Pl^^l^)^ ^he result in (a) shows that when ^ °^ > oo, the 

coalescence time of two lineages at distance at most 2R^ipL is indeed much smaller than 
Tl (which is of order logL x (l A pi'il^]^'^) by Proposition I6.2p . 

Proof of Proposition \6.4\ Recall that for each L E N, we defined as the difference 
between the locations of the lineages S,i and S,2 on the torus T(L). In the following, if 
both lineages are affected by the same event, we shall consider that hits but the 
number of lineages remains equal to 2, which means that they can separate again later 
(if the measures and are not all the point mass at 1). However, it is the first 
time at which such an event occurs which will be of interest, and we keep the notation 
tL to denote this time. As we already noticed, X^ behaves like {^^(2t), t > 0} outside 
i?(0, 2R^ij)L) , whereas inside the ball it can hit owing to reproduction events affecting 
both lineages and 

Case (a). For each L G N, set = Qq = and for every i > 1, 
Qf ^ inf [t > gf_i : X^(t) ^ b(o, ^^""V-l)} 

and 

qt ^ inf {t>Qf: X^(t) G i?(o, ^/^^Vl) }, 

with the convention that inf = +oo. We shall use the following lemmas, which will 
enable us to describe how X^ wanders around in T(L), independently of whether it ever 
hits or not. 

Lemma 6.5 There exist a function g : M-^- — )• vanishing at infinity, Cq > 0, Uq > 1 
and Lq £ N such that for every L > Lq and u > Uq, 

sup F^^^ [qf > pLu] < g{u) ff pi = 0{i;l), 

x(iB{0,mB)\B(0,(J /A)RB) 

sup F^^, [gf > i^lu] < \i pl^^l^Q. 

x£B{0,mB)\B{0,{7/4)RB) iOgU 

Lemma 16.51 will give us good control of the probability of a long excursion outside 
B(0,(3/2)i?^VL). 

Lemma 6.6 Suppose that 

Leb({r G [0, i?^] : i^^ i {5o,5i}}) > 0. (32) 

Then, there exists a constant Cq < oo such that for each L > 1, 

sup — E^^^[Qf] < Cq. 
x£B{0,i3/2)RB) PL 
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Condition (j32p guarantees that, whenever hits 0, it has a chance not to remain 
stuck at this value for all times. Lemma 16.61 then tells us that starting within 
B{{3/2)R^tPl) needs an average time of order 0{pl) to reach distance {7 /^R^ipL 
from the origin. 

Lemma 6.7 Suppose that PLi^J^"^ remains bounded as L ^ oo. Then, there exists 
Oi S (0, 1) such that for every L > 1, 

inf F^rxlX^ hits before leaving B(0, [7 U)R^ ipA] > 9i. (33) 

If lim inf L^rx) pj^^ipf^ = 0, there exist 62 G (0, 1) and 9^ > such that 
inf W^rxlX^ hits before leaving B (O, (7/4)i?^V'L)l 

>^2(l-exp{-e3|i}). (34) 

The proofs of these lemmas are given in Appendix IbI 

The following technique is inspired by that used in Cox & Durrett (2002) and Zahle 
et al. (2005), although the motions of the lineages and the mechanism of coalescence 
here are more complex and require slightly more work. Our plan is first to find a good 
lower bound on the number of times the lineages meet at distance less than {3/2)R^ipi 
(and then separate again) before time ^lPl- In a second step, we use the estimates 
on the probability that during such a gathering the lineages merge before separating 
again derived in Lemma IHTfl and obtain that coalescence does occur before ^lPl with 
probability tending to 1. For the sake of clarity, we show (j30p in the case where pii^^^ 
remains bounded, and then comment on how to adapt the arguments in the general 
case. 

Assume first that Condition (p2]l holds. Recall the definition of and given 
above, and define fc^ by 

A;L = niax{n: Qn <^lPl}- 

By Lemma 16.71 there exists a positive constant 9i such that for every L > 1 and 
xeB{0, {3/2)R^i;L), 

F^[X^ hits before leaving B{0, {7/i)R^^L)] > Oi- 

Hence, for every x G B{{),2R^iI>l) , we have 

^x[tL>^LPL\ <P4tL>QfcJ <E4(l-0i)*^^]. (35) 

Let us fix X G B(0,2R^'tp£^^ and show that /c/, — )• 00 as L — )• 00, in P2,-probability. 
The fact that the bounds obtained below do not depend on 2; G -B(0, 2R^'ijjL) will then 
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give us the desired uniformity. Let M € N. We have 

■ M M-1 

Y^iQf - <ili) + E ^'i^ - Q^) > ^LPL 

M-1 



1=1 



M 
i=l 



Qi-1 > 



i=l 
^LPL 

2M 



i=l 



^LPL 

2{M - 1) 



(36) 



where the last inequahty uses the fact that at least one of the 2M — 1 terms of the 
sums on the second line must be larger than a fraction (2M — of the total time. 
Now, using the Markov inequality, the strong Markov property at time qf_i and then 
Lemma 16.61 we can write for each i 



Qt 



> 



^LPL 

2M 



2M 

— mq^ 



Qi-i\ 



< 



< 



^LPL 

— — sup ¥.^^y [Q 

'i^LPL yeB{0,{3/2)RB) 



2MCq 



If we now apply the strong Markov property to at time and use Lemma 16.51 
together with the fact that X^{Qf) E B{0,AR^iPl) with probability one, we obtain for 
each i, and L large enough 



Qf> 



^LPL 

2{M - 1) 



Coming back to (|36p . we arrive at 



P^fci < M] < 



2M^Cq 



+ {M- l)g 



<9 



2{M - 1) 



0, 



as L — )■ oo. 



To complete the proof of (a) when Condition ([32]) holds and /OlV'l^ remains bounded, 
let e > and fix M = M(e) G N such that 

(1 - < s. 

Splitting the expectation in (f35]l into the integral over {k^ > M} and {k^ < M} yields 
limsup sup ^x[tL>^LPL] <e + limsup sup P2;[/cl<M] = e, 

and since e was arbitrary, the desired result follows. 

When Condition (p2]) is fulfilled but Pl'4'Z'^ is unbounded as L — )• oo, we can apply 
the same technique to obtain ()30p . This time, using the second result of Lemma 16.71 we 
can write as in ([35]) that, for every x E B(^0,2R^'ipLj , 

r / , . o/,2 . . \ ^Li 



l-..(l-exp{-.3|})) 
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The same arguments as above (using the second part of Lemma l6.5p yield, for L large 
enough, 



sup 

xeB{0,2_RS,/,i) 



kL<M^ 



2CqM^pI C.Mpi 



iji<^L V'ilog(^L/2M)' 



which tends to as L tends to infinity by our assumption of {^l)l>i- We conclude in 
the same manner, using the fact that when ipf^/pL — )• oo, 



2 . X \ AfpL/l/>2 



Let us finish the proof of (a) by removing the assumption (j32p . In the preceding 
proof, the main idea is that each time passes through B(^0, {3/2)R^iPl) , the two 
lineages have an opportunity to try to coalesce and their success probability is bounded 
from below by the quantity obtained in Lemma 16.71 However, if we do not assume 
that ([32]) holds, may become stuck at once it has hit it, and so the number ki 
of such sojourns in i3(0, {3/2)R^iPl) may be finite. This makes our arguments break 
down. Nevertheless, X^ can only hit through a coalescence event, and so this issue 
is merely an artefact of the technique of the proof. To overcome it, let us increase the 
rate of reproduction events by a factor 2, but divide each probability to be affected by 
2. Overall, coalescence will take a longer time in this new setting, but the motions of 
the lineages before their coalescence time will remain identical in distribution. 

More precisely, assume that (j32p does not hold. Define n£ as a Poisson point 
process on M x T(L) x (0, oo), independent of n£ and and with intensity measure 
2{pLip'j^)~^dt (g) dx (8) jjL^{dr), and for each r > such that = 5i, set = 5ii2- Let 
also n£ be a Poisson point process with the same distribution as and independent 
of all the other point processes. Call X^ the process defined in the same manner as 
X^ but with (resp., n|^, u^) replaced by Il£ (resp., z>^). By computing the 
intensity of the jumps of a single lineage, one can observe that it is equal to 

/ 2 L ix] /■■^^ L fx) 

\Pl J\x\/2 2vrr ' J\x\/2J0 

which is precisely that of Here, Lr{x) stands for the volume of B{0,r) n B{x,r). 
If we now compute the coalescence rate of two lineages at distance z G [0,2R^iPl], 
we obtain the same term due to small events for X^ and X^, to which is added the 
respective contributions of large events 

PL Jz/2 2.pL Jz/2 



Hence, the evolutions of both processes follow the same law outside B{0,2R^7pL), the 
contribution of large events whose area encompasses only one of the two lineages is 
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identical even within B{0, 2R^iI>l), and coalescence occurs at a higher rate for than 
for . This gives us for every L > 1 and x G T(L), 

where is defined in an obvious manner. But Condition ([32]) holds for X^, and so we 
can use the result obtained in the previous paragraph to complete the proof of (a) when 
(1321) does not hold. 



Case (6). The arguments are essentially the same. First of all, since we assumed that 
PL grows to infinity as L — )• oo, and because 

whenever > we can restrict our attention to sequences (^*l)l>i such that 
P2^^l — ?• as L — )• oo. Let £l denote the event that no large events affected any 
of the lineages before time Let Omax £ (0,oo) be such that the maximal rate at 
which at least one of the two lineages of the sample is affected by a large event is less 
than OmaxPi^ (recall that the total rate at which at least one of two lineages is affected 
is smaller than twice the corresponding rate for a single lineage, which is finite and 
independent of the location of the lineage). For each L E N, define e/, as an exponential 
random variable, with parameter O-iaaxPj} ■ By our assumption on (^i^ we can write 

^x[£i] < IP[eL < «>l] = 1 - exp I - ^"^J^^ I ^0, as L ^ oo. 

The distribution of the process X^ up to the first time at which it is affected by a large 
event is equal to that of X^ (defined as the process experiencing only small events) up 
to the random time e{X^), so that if Pl^6b,l{x) is the rate at which at least one of 
two lineages at separation x £ T(L) is affected by a large event, then for each t >0 and 

y G T(i) 

P,[e(X^) > t] = E, exp I - (^B,L{^^^i^)) 

By the definition of Omax, for each L G N the variable cl is stochastically bounded by 
e{X^). Consequently, if denotes the coalescence time associated to X^ (or, more 
precisely, to the model where lineages are affected only by small events), we have for 
each X G B{0,2R') 

Px[tL > <^l] <P4tL > ^l; £l] +Px[^£] 
<M^l>^l]+o{1) asL 



oo, 



where the remaining terms converge to uniformly in x G '^(L). Then, an easy modi- 
fication of the proof of (a) with "V'l = Pl = 1" yields the desired result and completes 
the proof of Proposition 16.41 □ 
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We can now turn to the proof of Theorem [ 
Proof of Theorem \6.3[ 

Cases (a) and (6). For (a), let us define for each L € N by 



$7 



Let t > and {Al)l>i be such that Al G r_4(L,2)* for each L G N. Introducing the 
time Tl needed for the two hneages of the sample to come at distance less than 2R^ipL, 
we can write 



Pa, 



(1 - a) plL'^ log L 
IL > : TT—^ t 



tL > 



27ra|Vi 
(1 - a)pLL'^ logL 



{l-a)pLL'^ logL 



■ {l-a)pLL'^ logL (1 -a)pL-^^^ log^ , ^ ' 
tl > — 9-79 t, 1l < ^ t - <Pl 



(37) 
(38) 



Using the strong Markov property at time and the uniform convergence derived 
in Proposition I6.4f a). we obtain that the expression in (|38p tends to as L — )• c« 
independently of the choice of t > and (^l)l6n- For ([57|) . note that 



Ar 



{l-a)pLL'^ logL (1 -a)pj.L^ logL 
> 9—79 t] J^L > ; ^^—^ t 



■ (l-a)pLL^ logL 

iL > ^ ^^-^ I 



(1 - a)pLL'^ logL 
2^a|^i 



(1 - a)pLL'^ logL 



(39) 



2vra|V'i 

Since (defined at the beginning of Section I6.ip has the same law as {^^{2t),t > 0} 
until the random time Tl, we can bound the quantity in (j39p by working directly with 
the latter process. In order to apply Lemma [531 to {^'^^^^(2/?^^), t > O}, with 

(l-a)L2logL L^ 



we need to check that Ul'4>\L~'^ 



UL 



2pL 2^llogL 



and R = 2R 



B 



00 and ul < 



(recall that this pro- 



cess evolves on the torus of size ipJ^^L.) Both conditions are fulfilled here, and so by 
Lemma EH the right-hand side of ([39l) is bounded by 



c 







as L 



00. 



PlL2 log L 

Hence, coming back to (|37p . we can use the result of Proposition 
in t > and {Ai)iyi of our estimates to obtain 



lim sup sup 

L-s>oo t>0 Ater^(L,2)* 



(1 - a)pLL^\ogL 

*i > n 2~72 * 



and the uniformity 



0. 
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The proof of (6) follows exactly the same lines, with = L^(log L) ^ and Lemma[53] 
applied to tpl^ {2^1)1-) . 

Case (c). In contrast with the two previous cases, where coalescence in the limit is due 
to large events only, here the pair of lineages can coalesce only through a small event. 
To see this, let us define T£ as the first time at which the two lineages (indexed by L) 
come at distance less than 2R'^ from each other, and tl as the first time at which at 
least one of them is affected by a large event while they are at distance less than 2R^iI)l 
(i.e., while G B{0,2R^'il)i)). Note that for each L, T£ and tl are stopping times 
with respect to the filtration {Tt, t > 0} associated to U n£ as we trace backwards 
in time. In addition, define as the entrance time of into B{0, 2R^) and as the 
first time makes a jump of size O^tpi) while it is lying in B{{),2R^iJjl). These two 
random times are stopping times with respect to the filtration {Tt, i > 0} associated 
to We claim that for each L E N, 

{X^(t), t<TL^ n] {e^(2t), 2t < A f£}, (40) 

where the notation refers to equality in distribution. Indeed, as long as has 
not entered B{0,2R^) and no large event has affected it while it lay in B(0,2R^iPl), 
coalescence events are impossible and the rates and distributions of the jumps of both 
processes are identical. We cannot include the terminal times in (|40|) since the values 
of the processes will differ if r/, A T£ = and the corresponding event is a coalescence, 
but since X^ and are jump processes with finite rates, we can easily see that the 
event {tl ATI = tl] (resp., ATJ = tl) is J^{rLATi)- (resp., -F(^^^f£)_) -measurable. 
Hence, for each L £ N, A = p2{xi,X2) and x = xi — X2 £ l'(^)) we have 

¥A[TL<Tl]=¥^[fL<fl]. (41) 

Let us now bound the right-hand side of (jlT]) under the assumption that (/?^^'(/;i)LeN is 
bounded. Analogous computations to those in the proof of Proposition 16.21 show that 
{^^{2t),t > 0} itself satisfies Assumption 15.21 with (t| = 2ag + o(l) as L — oo. Hence, 
Lemma E3] applied with di = 2R^ gives us 



lim sup sup 



■ XL 



~ L^logL 



0. (42) 



Let 

^max < OO be such that for every L E N, the rate at which makes a jump of size 
0{iPl) is bounded by Omax/PL- Fixing e > and K > such that e~^'^'^^^ < e, we have 
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for L large enough and any sequence {xl)l>i such that xl G r(L, 1) for every L: 



= P,.^ [fL <fl< KL^ log L] + P,^ [fL <fl;fl> KL" log L] 
< P,^ [tl < KL" log L] + P,^ [fl > KL" log L] 



< E, 



1 — exp 



{^max /" 
PL io 



KL'^ logL 



(^^(2s))(i5} 



+ e. 



(43) 



Splitting the integral below into the sum Jq 



and using the four results of Lemma [5^ there exists Lq G N, and ai, 02 > independent 
of L, {xl)l>i and K > 0, such that for every L > Lq, 

- pKL^logL 

. Jo J 
Hence, the first term on the right-hand side of (j43p is bounded by 



< {ai+a2K)i;l\ogL. 



KL^logL 



lB{0,2i?B^i)(C^(2s))(is 



< 6'max(oi + a2K) 



PL 



PL JO 

which tends to as L — )• 00, independently of the sequence {xl)l>i considered. As e 
in (|43p is arbitrary, we can conclude that 

lim sup P^^ [tl < fl] = 0, 

^^'^ XL&r{L,i) 

and by (j^Tj) . the same result holds for and any sequence {AL)Le'M such that € 
r_4(L, 2)* for every L. In words, we have obtained that with probability tending to 
1, any pair of lineages starting at distance 0{L) from each other gather at distance 
2R^ before having a chance to coalesce through a large reproduction event. By using 
the same method as in (a) but this time with the result of Proposition 16.41 (6) and with 
Proposition 16. 21 replaced by (j42|) . we obtain the desired conclusion under the assumption 
that (p^"'^Vi)L6N is bounded. 

When PL S> L^logL, with probability increasing to 1 no large events at all affect 
any of the lineages by the time they are gathered at distance 2R'^ by small events. The 
result then follows from the same arguments, with replaced by the motion of a single 
lineage subject to only small reproduction events. □ 



Remark 6.8 Let us comment on the cases not covered by the theorem, that is ip'^ ^> p^, 
PL is of order at most L?\ogL and p^^'^^'i log ^Oj^ a finite limit (possibly Oj. When 
the latter limit is positive, from the results obtained so far coalescence events due to 
small and to large reproduction events occur on the same timescale and depend on the 
precise paths of the two lineages. Therefore, we do not expect t^ to be exponentially 
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distributed (with a deterministic parameter). When pj^^ipf^log L tends to 0, the same 
reasoning as in the proof of (c) gives us that the probability that a large reproduction 
event causes the two lineages to coalesce before a time of order log L vanishes as 
L —7- oo. However, does not satisfy the conditions of Section \^ (Assumption \ 5.^) 
as it does when the assumptions of (c) hold. Using instead £^ = ilj~[^X^(ip'j^-), the time 
needed for the lineages to come at distance less than 2W translates into T{i^ ,2R^ /tPl), 
which is not covered by Lemma 15.31 and requires estimates of the entrance time of the 
jump process into a ball of shrinking radius, which we have been unable to obtain. 

6.2 Convergence to Kingman's coalescent 

To complete the proof of Theorem 13.31 we now turn to the genealogy of a finite sample, 
starting at distance 0{L) from each other on T(L). 

We can already see from our analysis for a single pair of lineages that our spatial 
A-coalescent is similar in several respects to the coalescing random walks dual to the 
two-dimensional voter and stepping-stone models with short-range interactions (see e.g. 
Cox & Griffeath 1986, 1990 for a study on Z^, and Cox 1989 or Zahle et al. 2005 for 
examples on the torii T(L)nZ^). It will therefore be no surprise that the analogy carries 
over to larger samples. In most of the papers cited above, the authors are interested 
in the sequence of processes giving the number of blocks in the ancestral partition. 
They show that, when the initial distance between the lineages grows to infinity, the 
finite-dimensional distributions of these counting processes converge to those of a pure 
death process corresponding to a time-change of the number of blocks of Kingman's 
coalescent. In Cox & Griffeath (1990), more elaborate arguments yield the convergence 
of the finite-dimensional distributions of the unlabelled genealogical processes to those of 
Kingman's coalescent. Instead of adding a new instance of such proofs to the literature, 
we shall simply explain why the same method applies to our case. This will also enable 
us to prove the tightness of the unlabelled genealogical processes. 

Proof of Theorem \3.iA (i) Convergence of the finite-dimensional distributions. 

We follow here the proofs in Cox & Griffeath (1986) (for the number of blocks of 
the ancestral partition) and Cox & Griffeath (1990) (for the unlabelled genealogical 
process of a system of coalescing simple random walks on 1?). Notice that, since we 
work on the torii T(L), our rescaling of time differs from Cox and Griffeath's. Another 
significant difference is the fact that, in their model, lineages move independently of each 
other until the first time two of them are on the same site, upon which they coalesce 
instantaneously. In our setting, the movements of lineages are defined from the same 
Poisson point processes, and two lineages having reached a distance that enables them 
to coalesce can separate again without coalescing. 

Despite these differences, Lemma [6.9l below shows that a key ingredient of their proof 
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is still valid here: at the time when two lineages coalesce, the others are at distance 
0{L) from each other and from the coalescing pair. To state this result, we need some 
notation. Let Tij be the first time lineages i and j come within distance less than 2R^ipL 
(resp., 2R^) if pi <^ Tpf^logL (resp., pL ^ Tp'j^logL) and r be the minimum of the Tj/s 
over all pairs considered. Let also t*j be the coalescence time of the ancestral lines of i 
and j, and r* be the minimum of the t*^ over all lineages considered. Finally, for each 
i we shall denote the motion in T(L) of the block containing i by 

Lemma 6.9 Under the conditions of Theorem \3.3\ we have 



lim sup ¥a 
lim sup Fa 



logL 



0, (44) 
0. (45) 



The proof of Lemma 16.91 is deferred to Appendix [Bl 

The other ingredients required to apply Cox and Griffeath's techniques are a control 
on the probability of "collision" for two lineages during a short interval of time, obtained 
here in Lemma [5.5| and the uniform convergence of the coalescence time of two lineages, 
which constitutes our Theorem 16.31 With these estimates, one can obtain the limiting 
rates of decrease of the number of blocks of A^'^ (namely those of the number of 
blocks in Kingman's coalescent), and the fact that mergers are only binary as in Cox 
& Griffeath (1986). In particular, the counterpart of their Proposition 2 here gives us 
that for each n G N, 

lim sup sup [l-4^'"(OI = n] - exp I - '^^^ ~ A = q, (46) 

which we state here because we shall need it for the case a = 1 (observe that our L 
corresponds to their t). Note that in Proposition 2 of Cox & Griffeath (1986), the 
right-hand side of their equation gives the probability that the number of blocks is less 
than n, instead of equal to n as it is stated. Furthermore, in ()46p the supremum is over 
t > instead of t G [0,T] for some T > (as in Cox & Griffeath 1986). Our argument 
for this modification is the fact that the two quantities we are comparing are monotone 
decreasing in t and both tend to 0. 

Then, the same arguments lead to the proof that any pair of lineages is equally likely 
to be the first one to coalesce, as in Lemma 1 of Cox & Griffeath (1990). The unifor- 
mity of the estimates obtained enables us to proceed by induction to show the uniform 
convergence (on a compact time-interval) of the one-dimensional distributions of A^'^ 
to those of /C, which translate into the uniform convergence of the finite-dimensional 
distributions, still on intervals of the form [0,T]. We refer to Cox & Griffeath (1990) 
for the complete proof of these results. 



39 



(ii) Tightness. 

This follows easily from the fact that the labelled partition with initial value 
in r^(L,n)* for some n G N lies in T^{L,n) immediately after each coalescence event, 
with probability tending to 1. Indeed, for each L G N, let 7^ < . . . < 7^_i be the 
ranked epochs of jumps of A^'^ (if less than n — 1 jumps occur, then the last times 
are equal to +00 by convention). Let also n G N, G Tj[{L,n)* for every L > 1, 
and following Ethier & Kurtz (1986), for every d,T > let w'{A^''',T,5) denote the 
modulus of continuity of the process A^'^ on the time interval [0, T] and with time-step 
6. Let e > 0. With the convention that (+00) — (+cxd) = +00, we have 

n 

Fa,[w'{A'',T,6) >e]< Y.fA,b^-lti < (47) 

k=2 

An easy recursion using the fact that we consider only finitely lineages and the uniform 
bounds obtained in Lemma \Q3\ enables us to write that for all A; G {1, . . . , n — 1}, 

sup P^' [y^ <oo ; A^{wLJk) ^ '['a{L, n)] 0, as L 00. 
A^er^{L,n)* 

This result and an application of the strong Markov property at time 7^_^ yield 

(n — k)(n — k — 1) ™ r r ,-, , ^ , 

<- '\ sup F^^ bf <5]+o{l) (48) 

as -L — )■ 00, where the last line uses the consistency of the genealogy to bound the 
probability that a first coalescence event occurs to the sample of lineages before 5 by the 
sum over all pairs of lineages of this sample of the probability that they have coalesced 
by time 5 (note that there are at most {n — k){n — k — l)/2 possible pairs just after 
Ik-i)- But these probabilities converge uniformly to 1 — by Theorem 16.31 and so 
for 6 small enough, we can make the right-hand side of ()48p less than e/(n^) for L large 
enough (n is fixed here). Coming back to (|47p . this gives us 

limsupPAi[w'(^^,r,(5) >£]<£. 

Since Vn is a compact metrisable space, we can apply Corollary 3.7.4 in Ethier & 
Kurtz (1986) to complete the proof. □ 



7 Proof of Theorem 13.7 



We now turn to the case ipi oc L. We still have small reproduction events of size 0(1), 
but now large events have sizes 0{L) (and rate 0{p~[^)), so that they cover a non- 
negligible fraction of the torus T(L). By Lemma 15.41 if the lineages were only subject 
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to small reproduction events, the location of a single lineage would be nearly uniformly 
distributed on T(L) after a time ^ L^- This suggests several limiting behaviours for 
the genealogical process , according to how pi scales with L^: 

• If is order at most 0{L'^), then large reproduction events occur at times when 
the locations of the lineages are still correlated with their starting points, and so 
we expect space (i.e., labels in the representation we adopted) to matter in the 
evolution of A^. 

• If <C PL L^^ogL, then the lineages have the time to homogenise their 
locations over T(L) before the first large event occurs, but not to come at distance 
2R^ from each other. Hence, large events should affect lineages independently of 
each other, and bring the genealogy down to the common ancestor of the sample 
before any pair of lineages experiences a coalescence due to small events. 

• If PL ~ log L, the fact that pairs of lineages have now the time to gather at 
distance 2R'^ should add a Kingman part (i.e., almost surely binary mergers) to 
the genealogical process obtained in the previous point. 

• If PL ^ L^logL, Kingman's coalescent due to small reproduction events should 
bring the ancestry of a sample of lineages down to a single lineage before any large 
event occurs, so that the limiting genealogy will not see these large events. 

Proof of Theorem \ 3. T[ For (a), let us write down the generator Ql of applied to 
functions of the T(l)-labelled partitions of {1, . . . ,n}. Recall the notation Xa for the 
label of the block a of a labelled partition A (introduced in Notation [23]), and write 
1^1 for the number of blocks of A. For each L > 1, / of class with respect to the labels 
and A ^ Vn such that any pair (01,02) of blocks of A satisfies \xa-i — XqjI > {^Ft^)/L, 
we have 



gU-{A) = pLy2l dy [ ^,^{dr)^ [ u^Mu)u 
f^JTiL) Jo Trr^ Jo 

x[f(^A\ {{a,,xa,)} U { (a„ x,, + f ) }) " f{A)] + G^'^HA), (49) 
where we wrote A = {(ai,Xai), . . . , (a|^|,2;a|^|)| and 
g(^) (A) 

= -a i dz p^{dr) — ^ Yl^x,€B{z,cr)}Yl^x,m^,cr)} 

^ -^^(1) Jo JBiz,cr) ^-,c{l,...,m}ie/ j^I 

E /' (1 - ^f^~^V{du) \f(A\[ U{(«- ^aj}) u { ( U y) }) - /(^) 



X 

Jci- 
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is the generator of the coalescence events due to large reproduction events (recall Vr is 
the volume of the ball i?x(i)(0, r)). Note that G^^'' does not depend on L. Let us look 
at a particular term in the sum on the right-hand side of (|49|) . Since / is of class 
with respect to the labels of the blocks, a Taylor expansion and the symmetry of the 
jumps due to small events give us 

PL I dy r fi%dr)^ /"^,(dn)tx[/(A\{(a„x,J}u{(a„x,^ + |)}) -/(^) 

where Aj is the Laplacian operator on T(l) applied to the label of the block m only. Since 
PlL~^ — 6 G [0, oo) by assumption and because / is continuous on a compact space, we 
obtain that Ghf defined on the compact set El = {^4 G '■ L\xai ~Xaj\ > 2i?* Vi 7^ j} 
converges uniformly towards 

1=1 

Now, by the same technique as in Section [5l one can prove that the gathering time at 
distance 2R'^ of two lineages starting at distance 0{L) on T(L) and subject only to small 
events converges uniformly on the time scale ^ ^°f^ to an Exp(l) random variable (in 
the sense of Lemma l5.3p . In addition, since the new location of a lineage affected by 
a large event is chosen uniformly over a ball of T{L) whose radius is of order 0{L), if 
a large event affects a pair of lineages but does not lead to their coalescence, then the 
probability that the lineages are at distance less than L(logL)^^ just after the event 
vanishes as L — )• 0. If we call T£ the first time at which two lineages on T(L) are 
gathered at distance 2R'^ and t*^ their coalescence time in the original timescale, we 
readily obtain that for any u > 0, and x'l 7^ x'2 G Tr(l)^, 

hm Pp,(L.;,L4) [tl >tl;tl< pLu] = 0. 

Indeed, as we already mentioned, if a large event does not make the lineages coalesce 
then with probability tending to one, the latter start at separation 0{L) and do not 
have the time to meet at distance 2i2* before the next large event. Now, the number 
of large reproduction events that the pair of lineages experiences before time plu can 
be stochastically bounded by a Poisson random variable whose parameter is finite and 
independent of L. Hence, if none of them leads to a coalescence then with probability 
tending to 1, T£ > piu. It follows that, if n > is fixed, we can use the consistency of 
the genealogy and write 

n 

Pp„(ix)[3t G [0,n] : ^^(t) ^ i?^] < IP{(W,l.,),({j},l.,)} > ^L^'^t < PLu] ^ 0. 

i<j=i 
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Consequently, one can use Corollary 4.8.7 in Ethier & Kurtz (1986) (with El as the 
subspace of interest in condition (/)) to conclude that the law under Pp„(Lx) of 
converges to that of A°°'''''^ as processes in the Skorohod space of all cadlag paths with 
values in the T(l)-labelled partitions of {1, . . . , n}. 

Let us now prove (6). Recall the assumption that the total rate at which large events 
occur is finite, that is M = c~^fi^{[0, (v^)^-*^]) < oo. Let us first analyse what happens 
during the first event which may affect the unlabelled ancestral partition. 

Define for each L > 1 the stopping time by the following property: pLef is the 
first time on the original timescale at which either a large event occurs, or A^ undergoes 
a coalescence event due to small reproduction events. Since large and small reproduction 
events are independent, Pl^i has the same distribution as the minimum of two following 
independent random times: 

• the first time of occurrence of a large event, that is an Exp(M//)L)-random vari- 
able. 



the time t*^ at which a first coalescence event occurs between lineages of the 
genealogical process A^ evolving only owing to small reproduction events. 



By P6|) applied to the case pl = +oo (i.e., no large events occur), ^^^^^^ converges 
to an Exp(n(n — l)/2)-random variable under P^l! uniformly in {AL)LeN such that 
Al G r_4(L,n)* for every L. It is then straightforward to obtain 

n(n — 1) 1 



lim sup sup 



exp 



M + f3- 



(50) 



where the formulation is also valid for /3 = 0. Also, by the independence of and 11^, 
for every {AL)L£f>s as above we have (with an abuse of notation) 



exp 



PL J 



Using Fubini's theorem and a change of variable, we can write 



exp 



— tl\ = Fa, exp - — 

PL J -I Jo L PL J 



Pa, 



L2 log L 



M 

PL 
tl< 



> S 



ds 



2^^ipL 
ML2 log L 



log s 



ds 



ML^ log L 
2vrcj2pL JO 
ML^ log L 
2-KalpL 



e 



1 



ML^ log L . 



L2logL 



du 



t} > u 
L2 log L ^ - 



du. 



When ;g > 0, we have ^^"^^ 



^ and so we can use the uniform convergence derived 



m 



and the fact that the distribution of t*^ does not charge points to conclude that 

/3n(n — 1 



lim sup 

Ai6r^(L,n)* 



Pa^ {pLi 



tl 



f3n{n -1) + 2M 



0. 
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The limit holds also for /3 = by a trivial argument. A byproduct of this result is the 
existence of a constant Cq > and Lq G N such that, for all L > Lq and {AL)L£m as 
above, PA^i/OLef < t*^] > Cq. We shall need this fact in the next paragraph. 

By Theorem 13.31 in the case = oo, up to an error term tending uniformly to 0, on 
the event {piSi = t*i} the transition occurring to A^'^ at time pLcf is the coalescence 
of a pair of blocks, each pair having the same probability to be the one which coalesces. 
Let us show that, conditioned on {pL^i < ^l}, the locations of the lineages at time 
{pL^i )— are approximately distributed as n independent uniform random variables on 
T(L). We use again the notation Tij,T*j and r, r*(= here) introduced in the proof of 
Theorem 13.31 for the gathering time at distance 2R^ and the coalescence time of lineages 
i and j, and their minima (once again on the original timescale). These quantities 
depend on L but, for the sake of clarity, we do not reflect that in our notation. In order 
to use our results on Levy processes, we need to make sure that no pairs of lineages 
have come at distance less than 2R^ before time Pl^-i- We have for each L G N 

n 

Fa^ [t < pLe{\ pLe{ <tl]< [nj < pief] pie^ < tl] , (51) 

i<j=i 

Each term on the right-hand side of ()5ip is bounded by 

[ Tij < pief - log L I pief < tl] 

+Fal [piei - logL < Tij < pLe{\ pie^ < tl] 
< Cq-i {Pa^ [f *• > fij + log L] + ¥a^ [fij £ [?L - log L, <^l)] } , (52) 

where for each L G N, ?l is an Exp(M//9i)-random variable independent of all other 
variables, and fij and f*j are defined as above, but for the process A^. By the strong 
Markov property applied at time fij and the result of Proposition 16.41 (b) . the first term 
on the right-hand side of ()52p converges to uniformly in Al £ r_4 (L,n)*. By a simple 
change of variable, the second term is equal to 

M e-^'' Fa, [nj G [pls - log L,pLs)]ds < M e~^'' C ds ^ 0, 

where the inequality comes from Lemma 15.51 Therefore, back to ()5ip we obtain that 

lim sup Pa^ [t < pLe{\ pLe{ < tl] = 0. (53) 

Now, let Di, . . . , Dn be n measurable subsets of T(l), and for each i G {1, . . . , n} and 
L > 1, let LDi C T{L) be the dilation of Di by a factor L. Let us show that 



lim sup 



{^f,...,Cn)ipLef-) G [LDi] X ... X {LDn)\pLef <tl 

n 

-J]Leb(A) =0, (54) 
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where ■^f'(t) denotes the location of the i-th hneage of at time t. To do so, let us use 
the fact that on the event {pl^i < t^}' genealogical process up to time plGi 
has the same distribution as A^ up to time ?l and on the event {f* > We have 



Pa, 



1 




<tl] 


1 








1 




PAjpLef 
M 









(ef , . . . , in){PLe{-) G J{{LD,)\pLe{ < tl\ 

i=l 

n 

[(ef , • • • , ^nXPLcf-) e YliLDi); pLcf < tl 

i=l 

n 

(lf,...,e^)fe-)en(^^'); '^L<f* 
1=1 

n 

(If, • • • , e YliLDi); ,L<f\+ 7^l{Al) 



i=l 



ds e-^'TA 



i=l 



T > PLS 



+ Vl{Al), 



(55) 



where ?7l(^l) tends to uniformly in (AL)LeN by (f53]) and the fact that Fa^ [plGi < t^] 
does not vanish. 

Let us fix s > for a moment, and consider the corresponding probability within the 
integral. Up to time f, the movements of the lineages are distributed as n independent 
copies If, • • • of the motion of a single lineage, for which an easy modification of 
LemmaEH (6) tells us that, if (eL)LGN is such that ez, — )• but elPl ^ as L — )• oo, 



lim sup sup 



S'^ivpL) G (LD)] -Leh{D) 



0. 



(56) 



However, it is not entirely clear that this convergence will still hold for n independent 
lineages on the event {f > pLs} (where f is the first time at which at least two of them 
come at distance less than 2R^). Keeping the notation Al for the initial value of the 
set of lineages and denoting the set of n (non-coalescing) motions by A^, we have 

Fa^ [(If, . . .,i^){pLS-) G (LDi) X ... X (LDn); f < pls] 



(If, • • ■,in){iPLS - r) - ) G (LZ^i) X ... X (LDn) 
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Splitting the preceding integral into {pl{s — £l) < t < Pls^ and {f < pl{s — gl)}, we 
can use (l56l) in the latter case to write 



i=l 



^{PL{s-eL)<r<PLs} ^A^{t) 



\i{,...,i^)[{pLS-r)-)eJ{{LDi 



i=l 



n 

+ (j]Leb(A))PAjf < PL{s-eL)\ + 5L{AL) 



(57) 



i=l 



where {6l{Al))i^^ tends to zero uniformly in [AL)i^fi as L tends to infinity (we still 
impose that Al £ Tj[{L,n)* for every L). By the convergence of the distribution 
function of ^2 to that of an exponential random variable, uniformly in the time 
variable and in {Al)l^^, we obtain that Pa£[pl(s — ^l) < t < pis] converges to 
uniformly in (j4i)igpj (which is also true if /3 = 0, i.e., pi <^ L^logL). Hence, we can 
find a sequence {5'^{Al)) Len decreasing to uniformly in (^L)LeN, such that the whole 
sum on the right-hand side of (f57|l is equal to 



n 

(nLeb(A))PAjr < pls] + 6'l{Al) 



1=1 



Likewise, we can find another sequence {6'l)L(zfq decreasing to zero uniformly in (A/,)^^^ 
such that 



^A, [iii, ■ ■ ■An){PLS-) G {LD^) X ... X {LDn)] = nLeb(A) + 51{Al) 
Subtracting the two last equalities, we obtain 



j=i 



Pa, 



{i{,...An){pLS-)£\{{LDi)-f> pLs\ = \ nLeb(A)[PAjT>pLs] + o(l), 



i=l 



i=l 



where the remainder decreases to uniformly in s > and (^l)l>i such that Al G 
r_4(L,n)* for each L. Coming back to ()55p . we obtain that it is equal to 



M 



¥AAPLe{<tl] 



ds e-''' I ( f[ Leb( A)) ^A, [r > pls] + o(1) | 



PAjT>a] 



nLeb(A)+o(l) 



i=l 

PAjT*>a]+0(l) 



nLeb(A) + o(l), 



i=l 



where the last line uses (|53]l . We can thus conclude that (|5ll) holds. 
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Condition on the first event being a large reproduction event. By the description of 
such an event, the result for the genealogical process is the merger of at most one group 
of blocks into a bigger block. Furthermore, the transitions depend only on the number of 
blocks and their labels, so for convenience we derive the transition probabilities for Al 
of the form Pn(x) only, although we shall use the result later for more general labelled 
partitions. Let vr be a partition of {1, . . . ,n} such that vr has exactly one block of size 
greater than 1, which we call J. Then if the large event has centre x and radius cr in 
T(l), the probability that the transition undergone by A^'^ is p„ — t- vr is the probability 
that at this time, at least all the lineages in J have labels in B{x,cr) and are really 
affected by the event, and all the other lineages present in B(x,cr) are not affected 
by the event. Summing over all possible choices I C {1, . . . ,n} \ J for these "other 
lineages" (I can be empty) and using ([5^ . the probability of the transition p„ — )• vr up 
to a vanishing error is given by 

J Jo 

l-l 'r^-\J\ / _ I T|\ 



i=0 

\uV,rf\{il - u)V,r + 1 - VcrT-\'\yf[du) 







{uV,r)\'\{l - uV,rT-\'\v^{du). (58) 

We now have the results we need to show {h). For every L G N, let us consider again 
the time introduced earlier, and define for each integer i > 2, 

ef = inf {t > e^_i : pit £ Ilf or p^t is the epoch of a coalescence 
due to small events}. 

Let us also define similar times corresponding to A'-^''^^ . From the expression of its rates 
given in Definition 13.61 A^'^''^-' is composed of a Kingman part (i.e., only binary mergers) 
run at rate /3, and of a set of multiple mergers due to the part A'^'^^ of its A-measure 
with the atom at removed. Furthermore, the finite measure A(o) on [0, 1] is given by 

A(0)(df) = c-V / u^{{u : uVcr G dv})p^{dr) 

-2 2 
= C V 



J, l{v.,.>.}iy?[d—)p''idr). 



Following Pitman's Poissonian construction of a coalescent with multiple mergers (whose 
A-measure has no atom at 0, see Pitman 1999), let us define H as a Poisson point process 
on M.j^ X [0, 1] with intensity dt v~'^A^^^ (dv). Note that because of our assumption on 
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M, v~'^A^^\dv) is also a finite measure, with total mass M. The atoms of 11 constitute 
the times at which A^^''^^ acting on the partitions of N experiences a multiple collision, 
and the probabilities that any given lineage is affected by the event. The Kingman part 
of A^^''^) is superimposed on this construction by assigning to all pairs of blocks of the 
current partition independent exponential clocks with parameter /3, giving the time at 
which the corresponding pair merges into one block. 

From now on, we consider only the restriction of A*-^'^-* to Vn, although we do not 
make it appear in the notation. Let ei be the minimum of the first time a pair of blocks 
of A(/5''=) merges due to the Kingman part and of the time corresponding to the first 
point of n. Define in a similar manner for all i > 2, so that (ej)jgp^ is an increasing 
sequence of random times at which A^'^''^^ may undergo a transition. Our goal is to 
show that the finite-dimensional distributions of {(ef , ^^'"(ef )), i G N} under P^i^ 
converge to those of {{ei, A^^'^\ei)), i G N} under Pp^, as L — oo. Since A^''^ (resp., 
j\(/3,c)-j gg^j^ jump only at the times ef (resp., Cj), the fact that only finitely many jumps 
occur to A^^''^^ in any compact time interval, together with Proposition 3.6.5 in Ethier 
& Kurtz (1986) enable us to conclude that this convergence yields (6). We proceed by 
induction, by showing that for each i G N: 

H{i) : if ai £ T^{L,n) for each L and there exists ttq £ Vn such that for all L G N, 
bl(ai) = ttq, then as L ^ oo 

^P., ({(ef ,^^'"(ef )), . . . , (ef ,^^'"(ef ))}) £p,^ {{{e,, A^^'^'He,)), . . . , (e„ A('^'^)(e.))}). 

(Note that ai can have less than n blocks). 

Let us start by H{1). Let t > 0, it G Vn and write hq for the number of blocks of 
ttq. We have, in the notation used in the previous paragraph (and with A^'^ defined as 
the unlabelled partition induced by A^ on the timescale pl), 

Pa, [ef < t- ^^'"(ef ) = vr] 

= Pa, [ef < t; ^^'"(ef ) = vr; p^ef = tl] + P,^ [ef < t; ^^'"(ef ) = vr; p^ef < tl] 

= Pa, [tl < PLt; A'^^^itl/pL) = 7r;tl< Cl] (59) 

+Pa, [ef < t; ^^'"(ef) = vr| pief < t^] P^, [pLcf < tl] . (60) 

By Theorem l3. 31 applied with p^ = +oo, A^'^ with initial value converges as L — )• oo 
to Kingman's coalescent /C^^^ started at ttq and run at rate /3, as a process in D-p^ [0, oo) 
(if /3 = 0, then A^'^ converges to the constant process equal to vro). Hence, by the 
independence of A^ and C,l for every L and a simple time-change, the quantity in (|59p 
tends to that corresponding to K,'^l^\ that is 

P^o [/C(^) (ef ) = vr] P^„ [ef < t A C] , (61) 



48 



where ef is distributed like an Exp(/3 "'°*-"° ''^^ )-random variable and stands for the 
epoch of the first event occurring to /C(^\ and C is an Exp(M)-random variable. By 
the construction of A^'^''^^ given in the last paragraph, (I6ip is the probability that the 
first event occurring to A^^''^) happens before time t, is due to the Kingman part of the 
coalescent and leads to the transition vro — )• vr. For (j60p . note first that because n£ and 
n£ are independent, if we condition on pi^i being the time of the first point (ij", xf, r^') 
of n^, then and the pair (xf ,rf') are independent. Hence, we have for each L > 1 

Pajef < t ; ^^'"(ef) = 7r| PLef < tl] 

= Pa^ [ef < t\ pLe{ < tl]Fa, [^^'^(ef) = 7r\ pLe[ < tl] . 

Using ()50p and the same reasoning as for ()59p . we can write 

.[e{<t\ PLe{<tl] [pLe{<tl] 

exp { - (m + p'^^^^y] - [ef < t A C] 
[C<tAef], 



[ef < t- pLsf < tl] 



[ef <t]- Fa, [e{ < t; pLef = tl] 



■ ""0 I 



where the last equality comes from the fact that an Exp(/3 "'°^"2 +M) -random vari- 
able has the same distribution as the minimum of an Exp(/3 "°^"°~^) )- and an Exp(M)- 
random variables, independent of each other. In addition, by the calculation done in 

dSHl), 

Fa^ [^^'"(ef ) = ^1 pLe^ < tl] ^ [a(°) (ef ) = tt] , as L ^ oo, 

where Ci is the time of the first event of 11. Combining the above, and recognizing the 
transition probability of A^^''^^ through the decomposition obtained, we can write 

lim P,^ [ef < t; ^^'"(ef) = vr] = P^^ [ei < t; A^^'^\ei) = vr] . 

Since this result holds for each t > and ttq S Vn, using a monotone class argument 
we can conclude that the distribution of (ef , ^^''"(ef')) under P^^ converges to the 
distribution of (ei, A(^''^)(ei)) under Pttq as L — )• oo. This proves H{1). 

Suppose that H{i — 1) holds for some i > 2. Let D C t > and vri, . . . , vrj G 

Vn- Let also L € N. By the strong Markov property applied to at time pi&i-i, we 
have 



P,J(ef',...,ef_i) G A ef-eti<i; ^^'"(ef) = vn, . . . , ^^'"(ef ) = vr, 

L{{ef,...,ef_,)eD} lMi."(ef)=^i,...,^i>"(ef_,)=^._i} 
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First, using arguments analogous to those leading to Lemma 16.91 up to an error term 
vanishing uniformly in (0^)2,^^ such that ql £ r(L, n) for every L G N, we can consider 
that A^ipLet^) G r^(L,n). As hl{A^{pLeti)) = TTi^i for each L, we can therefore 
use -ff (1) to write that 

2im^P^.(^^,._^) [ef < t; ^^'"(ef ) = vr,] = P^,_, [ei < t; k^P^'\ei) = vr,] , 

and so dominated convergence and H{i — 1) give us 



lim P,^ [(ef , . . . , eti) ^ D; - e^i < t; ^^'"(ef ) = vn, . . . ,^^'"(ef ) = vr,] 

l{(ei,...,e,_i)GD} l{A(/3.<=)(ei)=7ri,...,A('5.<=)(e,_i)=7r,_i}IPT,_i [ci < t; A(^''=)(ei) = TTj] 

[(ei,...,e,_i) G L»; - e,_i < t; A^'^'^Hei) = tti, . . . , A^^^'^^ei) = vr,] , 



L— >oo 

= E, 



1"0 I 



which again yields H{i) by standard arguments. The induction is now complete, and 
so we can conclude that the finite-dimensional distributions of the embedded Markov 
chain and the holding times of A^'^ under P^^ converge as L — )• 00 towards those of 
^(/3,c) undgj. xhe proof of (6) is then complete. 

To finish, suppose that pL ^ L^logL. Then, we can find a sequence increasing 
to +00 such that 

sup ¥a[ a large event affects at least one lineage before time ^^L^logL] — )• 

as L — )• 00. Hence, we can couple A^ with the process A^ which experiences only 
small events, so that the time by which they differ at step L is larger than with 
probability tending to one, uniformly in the sequence {Al)l>i chosen as above. By 
the results obtained in Section [6] with pi = +00, we know that A^'^ converges in 
distribution towards /C, as a process in D-p^ [0, 00). Since the sample size n is finite and 
under Kingman's coalescent, a sample of n lineages reaches a common ancestor in finite 
time almost surely, (c) follows. □ 

A Proofs of the results of Section [5] 

Since the proofs of Lemmas 15.31 and 15.41 are highly reminiscent of those of Theorem 2 
and Lemma 3.1 in Cox & Durrett (2002), we shall only give the arguments we need to 
modify and refer to their paper for more extensive proofs. 

Proof of Lemma \5.4\ ' Since is a Levy process, for any integers n and L one can 
decompose i^{n) into 



i\n)=£\0) + ^{£^ik)-£'^ik-l)}, 



k=l 
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where the n terms in the sum are i.i.d. random variables whose common distribution 
is that of £^{1) under Pq- Using Bhattacharya's local central limit theorem (see The- 
orem 1.5 in Bhattacharya 1977) and the boundedness assumption on Eo[|^^(l)|^], we 
can control the deviation of p^{x,n) from the corresponding probabilities for Brownian 
motion up to an error of order o(n^^) independent of L. Following Cox and Durrett's 
arguments, we obtain the desired results for integer times. For arbitrary times t, the 
Markov property applied to £^ at time [t\ (plus, for (d), the fact that the variations of 
£^ are bounded on a time interval [n,n + 1]) completes the proof. □ 

Proof of Lemma ] 5. 3[ To simplify notation, we shall write T{dL) instead of T{dL,£^) in 
the rest of the proof. For every L > 1, x & "^(L) and A > 0, let us define the following 
quantities : 



FL(x,A)=E4exp(-AT((iL))]^ 



Gl(x,A) 



{ei^(t)eB{o,dL)} 



dt 



Applying the strong Markov property to £^ at time T{dL) and using a change of vari- 
ables, we obtain (for any xl) 



Gl{xl, A) = E,^ [e-^^('^^)GL(£^(r(di)), A) 



(62) 



From the results of Lemma E31 we can derive the asymptotic behaviour of Gi(xi, A). 
To this end, let {vl)l>i and {ul)l>i be two sequences growing to infinity such that 
fL(log L)"-"^/^ — )■ and n/,(logL)~^ — )• as L — )• oo. Splitting the integral in the 
definition of Gl{xl, ttTo^l) ^'^^^ pieces, we obtain first by (b) of Lemma [5^ 



dl log L J^^L2 



exp 



At 



L2 log L 
1 



p^{xL,t)dt 



dilogL 

IT 

A 



exp 



VlL2 

logL 



Xt 



^ a + 5r 1 dt 



L'^logLJ L2 
)(l+'^L,l) = ^(l + 5l,l) 



as L — oo, where ^ 



uniformly in x G T(L). By (a) of Lemma 15.4^ we have 



1 



exp 



Xt 



L2 log L 



P^{xL,t) dt < 



1 



Old 



4logL [L^SL 
VlogL 



'2 



0, as L 



oo 
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by our assumption on vl ■ By (c) of Lemma [5 

fSLL^ 



1 



d^logL JuL{i+\^L\^vdl) 



exp 



L' 



Xt 



L2 log L 



p^{xL, t) dt 



4logL JuLdMxL^wdl) '^(^It 



2allogL 

1 -/3 V7 



(1 + 5l,2) dt 

2 log L - log(l + V 4) + log EL - log ) (1 + ^^,2) 

(1 + <5l,2)' 



whenever ^"f^g^^^^ f3 as L grows to infinity. Here again, Sl,2,S'l2 ~^ ^ uniformly in 
X € T(L) as L — )• 00. Finally, by (d) of Lemma 15.41 we can write 



4logL Jq 



«L(l+|a;Lpv4) 



exp 



At 



L2 log L 



< 



C2 ul{1 + \xl\'^V dl) 
dilogL i + df\xL\^ 



independently of {xl)l>i since does not vanish and tiL(logL) ^ — )• 0. 
Combining the above, we obtain that if ^"f^'^"^' /3, then 



djlogL 



A 



L2 log L 



A + + "^')' 



as L 



CXO, 



where the remainder does not depend on {xl)l>i. Coming back to (j62p with xl E 
r(L, 1), the uniform convergence obtained above, together with the fact that i^{di) G 
B(0, di) a.s. yield 

\7Ta^T{dL) W {l-^)/{a^X) 1 



hm E, 

L— >oo 



exp 



(1 -7)L2 logL 



(1-7)/(^2^) + (1-7)/<t2 1 + A' 



(63) 



which we recognize as the Laplace transform of an Exp(l)-random variable. Since the 
left-hand side of (f63|) is monotone in A and the function A 1— )• (1 + A)~^ is continuous, 
this convergence is in fact uniform in A > 0. By standard approximation arguments (see 
for instance the proof of Theorem 4 in Cox 1989), we obtain that for any fixed t > 0, 

Avra^ 



lim sup 



XL 



(1 -7)L2 logL 



T{dL) > t 



0, 



and, by monotonicity and the fact that all the quantities involved tend to as t —t- 00, 
this convergence is uniform in t > 0. The interested reader will find all the missing 
details in the appendix of Cox &; Durrett (2002). □ 
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Proof of Lemma \5. 51 ' Let x G T(L) and {U'^)Le'N be as in the statement of Lemma 15.51 
Using the strong Markov property at time T{R,£^), we can write 



> 



F,[T{R,£^) e ds,£^{s) e dy]Fy[r{U'L + UL - s) e B{0,R)]. (64) 

U'j^-UL JB{0,R) 



Note that, on the right-hand side of the quantity U'^ + ul — s hes in [ul, 2ul\- We 
assumed that 2ul < L^(log L)^^/^, and so we can use (c) of Lemma [5^ with di = R 
and write 



hm sup sup 

L^oo yi^B{0,R) UL<t<2uL 



R^ 



\[r{t) G B{0,R)] -1 



0, 



which gives us the existence of a constant Co and of an index Lq such that for each 
L> Lo,y £ B{0,R) and t G [nL,2'UL], 

P,[^^(t)G5(0,i?)]>^>^. 

Furthermore, since UlL^'^ — )• cx), we can use (6) of Lemma [53] to obtain the existence of 
Li G N and a constant Ci > depending only on (C/l)l>i such that for every L > Li, 



sup sup 

t>UL yeT{L) 



Fy[£'^{t + UL)eB{0,R)] ""^^ 



L2 

Using these two inequahties in ([6l]) . we have for L large enough and for all x G T(L) 



^i±^ > F4T{R,e^) G [f/i - n,,U',]] x ^, 
which gives us the desired result. □ 

B Proof of the technical points of Section [6] 

Proof of Lemma 1 6. 5[ Let us start with the case pi = 0{il)\) as L — t- oo. The rate 
of decay of the probability of a long excursion is known for simple random walks and 
Brownian motion (see Ridler-Rowe 1966), and so the proof of Proposition 16.21 suggests 
that we should consider the process £^ = ipj^^ X^{pi-). But here is not a Levy 
process, since is the difference of the locations of two lineages whose motions are 
not independent in S(0,2i?^^i). However, it is not difficult to convince oneself that 
for each y G B{^,{1 /A)R^Y, the return time into 5(0, (3/2)i?^) of starting at y 
is smaller than or equal to that of defined as the rescaled process ip~[^C^{pL') also 
starting at y. Indeed, the rate at which reproduction events affect at least one of the 
lineages is bounded from below by the rate at which a single lineage is affected, the 
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distribution of the jumps of and are identical outside B{0,2R^) and inside this 
ball, coalescence events make it easier for to enter B{0, (3/2)i?^). Hence, we shall 
establish the desired bound for In addition, we shall consider that £^ evolves on 
instead of T(L), since the return time here can only increase with the available space. 

For each L G N, set (Jq = and let {a^)i^f^ be the sequence of jump times of Let 
PlOs (resp., 6b) be the jump rate of £^ due to small events (resp., due to large events). 
The quantities Og and do not depend on L since fi^, fi'^ and the probability measures 
i^r'' do not. For each t > 0, we have l^{t) = ^^(0) + Ei:ai<t {^^('^Z^) where 
{(■^{(Ti') — f-^ i^t-i)) ^ sequence of i.i.d. random variables with covariance matrix 
of the form f^Id. Using the distribution of a single small or large jump and the fact 
that a given jump is a small one with probability 6sPl/{GsPl + 0b), we easily check 
that there exists V > 0, independent of L, such that vl V/ pL s^s L oo (recall our 
assumption pL = C(V'i))- 

Let X in B(0,4R^) \ B{0, {7/4)R^) and let be a two-dimensional Brownian mo- 
tion starting at x. For each L G N, by the Skorohod Embedding Theorem (see, e.g., 
Billingsley 1995) one can construct a sequence (sf )jgN of stopping times such that the 
W{s^) have the same joint distributions as the ^^(o-f) : for every i G N, conditionally 
on W{s^_i), is the first time greater than sf_^ at which W leaves B (W {sf'_-^) , rf") , 
where is a random variable independent of W and of {sj',j < i} having the same 
distribution as the length of the first jump of Now, we claim that there exists 
7 > independent of L and x, such that each time W visits B{0, /2) and then leaves 
B{0, (3/2) i?^), the probability that one of the 's falls into the corresponding period of 
time that W spends within B{Q, {S/2)R^) is at least 7. Indeed, set To{W) = fo{W) = 
and define the sequences of stopping times {Tk{W),k > 1} and {Tk{W),k > 1} by in- 
duction in the following manner: 

n{W)=mi{t>n-i(W) : Wit) G B{0,R''/2)}, 
fk{W) = mf {t > Tk{W) : W{t) i B{0, (3/2)i?^)}. 

(Note that each Tj. is a.s. finite due to the recurrence of two-dimensional Brownian 
motion.) Then for each A; G N, if j is the index of the last sf before Tk{W) and sj" 
corresponds to a small event, by construction we have \W{sf)-W{Tk{W))\ < 2i?*VZ^ 
and so W{sj) G -8(0, (3/2).R^) for L large enough. If sj" is due to a large event 
and W{sf) i -8(0, (3/2)i2^), then necessarily W{s]) G -8(0, {b/2)R^). But the exit 
point from a ball B of Brownian motion started at the centre of this ball is uniformly 
distributed over the boundary of -B, and so one can define 7 as the minimum over (y, r) 
with \y\ > 3R^/2 and \y\ - R^ /2 < r < 2R^ of the probability that W started at 
y escapes B{y,r), through the part of its boundary which lies within -8(0, (3/2)-R^). 
Hence, if we define for each t > the random variable N(t) as the maximal integer k 
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such that Tk{W) < t, we can write for each L 

P^,x[qf>PLu] =P4^^(a^^)^i?(0,(3/2)i?^), yj<i{u,L)] < E4(1 - 7)^(^^(".^))] , 

where i{u,L) = max{j : aj < u}. Since is a.s. a non-decreasing function of t, we 
have for any given m G M+ 

Pv^.x [qf > PLu] < E,. [(1 - 7)^("^")] + P. < H • (65) 

Now, i{u,L) is the number of points of the Poisson point processes n£ and n£ which 
fall into the time interval [0,iip2,] on the original timescale, it is therefore a Poisson 
random variable with parameter u{9sPl + ^_b)- If o > 0, then by the Markov inequality 

^x[i{u,L) < auOsPL] < e"''^^''iK[e-*("'^)] = exp {u9sPL{a + e'^ - 1) +M0B(e-^ - 1)}, 

so that this quantity converges exponentially fast to for a > small enough. On the 
event {i{u,L) > auOsPi}: ^ii^uL) snvn of at least auOsPi i-i-d. random variables, 

each of which corresponds to the exit time of Brownian motion from a ball of radius 
at most 2R'^ with probability OsPl/ {^sPl + Gb) and to the exit time of Brownian 
motion from a ball of radius at most 2R^ otherwise. Therefore, one can find > 
independent of L such that ]E[sf] ~ ^ Pj} as L — t- oo. Using the same technique as 
above then gives us that for m > small enough, there exists K{m) > and L[m) G N 
such that for all L > L{m) and u >0, 

F,[i{u,L) > auOsPL, < mu] < e--(-)''^-. 

Let us now prove that 

lo£ lo£ U 

FJN(mu) < K log log d < C f ^ 

logn 

for a constant C > independent of x G 5(0, 4i?^) \ B{0,{7/4:)R^) and u large 
enough (again independently of x). The reasoning is identical to that made to ar- 
rive at (p6|) . with (resp., ) replaced by Ti{W) (resp., Ti{W)). Using the fact that 
Ci = sup^(,B(o,RB /2)^x[Ti{W)] < oo and 

sup F,[Ti{W) > u] < (66) 
j/eS(o,4RS) logn 

for a constant C2 and u large enough (see Theorem 2 in Ridler-Rowe 1966), we can 
conclude that for each x G B{0,AR^) \ B{0, (7/4)i?^), and u large enough, 

JO, r.rf 1 n . 2Ci(loglogu)2 C2loglogn C'loglogu 
¥^[N{mu) < log log nj < h 7 — < 



mu log (mu/(2 log log n)) logn 
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again for C" > and u large enough independently of x. Coming back to ()65p . we 
obtain for a constant C" > and for all x G B{0, iR^) \ B{0, {7/4)R^), 



'i^LX [qf > PLu] < (1 - 7)i°gi°s« + [N{mu) < log log u] + e" 



< (1 _ ^)iogiog« + ^'^Qg^Qg^ + e-c^"«. (67) 
log u 

Define g{u) as the expression on the right-hand side of (j67p to obtain the result. 

When V'iPi^ ~^ 0, the probability that a large event occurs by time uipf^ is given by 



1 - expi - 9bu—\ as L 
I. Or J 



oo. 

PL ' 

On the event that no large events occur by time wipf^, the first visit of W into B{0, /2) 
will produce a time such that Ty(sf ) G -6(0, (3/2)/?-^) with probability 1 for the 
reason expounded above, and so the first term on the right-hand side of (|65p is now the 
probability that Ti{W) is greater than mu. The inequality in (|66p and the exponential 
decay of P2^[s^^^^ < mu] now imply the result. □ 

Proof of Lemma 1 6. 6[ The arguments are slightly different according to whether Pl'^J^'^ 
is bounded or tends to infinity as L — )• oo. Let us consider the first case. Recall the 
definition of pJ}Ob given in the proof of Lemma 16.51 as the maximal rate at which 
a lineage is affected by a large event. The coalescence rate of two lineages is then 
bounded by Ip^^Os, regardless of their locations. By our assumption there exist 
r G (0, R^) and 5 > such that Leb({r' G [r, r + 6] : u^, ^ {5q, 6i}]) > 0. We shall use 
these events to send the two lineages at distance at least {7 /4)R^'ijjL from each other, 
whatever their initial separation was. The proof is quite natural, so we just give the main 
arguments. If only large jumps occurred, then if a sequence of at least 7i?^/(2r) large 
events increased \X^\ by at least {r/2)'ipL each before the first coalescence happened, 
starting within B{0, {3/2)R^iI;l) would certainly leave -6(0, {1 / 4)R^ ipi)- Moreover, 
a large event affecting and conditioned on not leading to a coalescence biases the 
jump towards increasing \X^\ (we do not allow some centres that are too close to both 
lineages) . This remark and ()32p guarantee that the rate at which these separating events 
occur (that is, events increasing \X^\ by at least {r/2)'tpL) is bounded from below by 
Pj}(^sepi where O^ep is a positive constant. The total rate at which large events affect X^ 
is bounded by 2p2^6b, and so there is a positive probability Psep, independent of the 
starting point of X^, that X^ leaves -6(0, {7/4:)R^iIjl) before coming back to through 
a (large) coalescence event. As regards the effect of small events, recall that we assumed 
that /OlV'l^ bounded. Hence, the probability that X^ starting from -6(0, ripL/2)'^ does 
not enter B{0,2R^) after a time of order 0{pl) only through small jumps is bounded 
from below and by the symmetry of these small jumps, with probability at least 1/2 the 
radius of X^ increases between two large jumps. Hence, up to modifying psep to take into 
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account the effect of the small jumps, the probability that leaves -6(0, {7 / ^)R^ ipi) 
before coming back to is still bounded from below by psep > 0. 

By the definition of and Assumption (|lUp . large events of size close to occur 
at a positive rate and lead to the coalescence of the lineages with positive probability, so 
that the waiting time for the coalescence of two lineages at distance at most {7 /A)R^il)L 
is bounded by pi times an exponential with positive parameter 7. This gives us that 
pJ^Qi is stochastically bounded by Yl\=i where k is geometric with success prob- 
ability psep > and {Ni^i G N} is a sequence of i.i.d. Exp(7) random variables, all of 
them independent of the initial value x S -6(0, {'i/2)R^ ipi) of . We can therefore 
choose Cq = (7Psep)""^- 

When P~[}'4''l ~^ 0, if we use the same reasoning as above there is a positive proba- 
bility that a large event separates the two lineages at distance at least rili^, regardless of 
their separation just before this event. In addition, the rate of these separating events is 
at least equal to pJ^Osep > 0. Between two large events, X^ only does small jumps, and 
as long as X^ ^ -6(0, 2R^), the Skorohod Embedding Theorem (see the proof of Lemma 
[63|) enables us to assert that X^ will leave -6(0, {7/A)R^iPl) in a time of order 0{^j^). 
Moreover, for e > small, the same argument shows that the probability that X^ 
leaves -6(0, {7 /^R^ipL) before entering B{Q,eipL) is bounded from below by a constant 
Pesc > independent of L and of the value y G B{0,ripLy of X^ just after the large 
jump described above. A fortiori, pesc is also a lower bound for the probability that 
X^ started at y leaves -6(0, {7/4)R^iPl) before entering B[0,2R'^) only through small 
jumps, and so we obtain that between two large events such that the first large jump 
sends (or keeps) X^ out of -6(0, tV'l), X^ escapes -6(0, (7/4)-R^Vl) with probability at 
least Pesc (recall that the total rate of large events affecting at least one of the lineages 
is bounded by 29bp2^ ^^'^ ^ V'i)- Consequently, is this time stochastically 
bounded by 'Y^=i^i{^)i where /c is a geometric random variable with success proba- 
bility Pesc > and for each L G N, {Ni{L),i G N} is a sequence of i.i.d. Exp{p2^0sep) 
random variables, all of them independent of the initial value x G -6(0, {2>/2)R^iIjl) of 
X^ . The desired result follows, with Cq = {OsepPesc)~^ ■ D 

Proof of Lemma 6.1: The inequality in (j33p can be restated as in (j34p (the quantity 
inside the brackets then tends to 1), so we prove both inequalities using this form. 
Let 9c be such that Pi^Oc is the minimum rate at which two lineages at distance at 
most (1 -|- 6)R^ipL from each other coalesce (where (5 > is defined at the beginning 
of the proof of Lemma l6.6p . By the definition of R^ and assumption (|10p . the rate at 
which a reproduction event of radius r G [-^■^(l — j)tpL, R^^^l] occurs and leads to the 
coalescence of the lineages does not vanish as L tends to infinity (when multiplied by 
Pl), and so 6c > 0. Let us show that if r/ > is small enough, the probability that X^ 
starting within -6(0, R^tpi) does not leave -6(0, {1 + 6)R^iPl) through only small jumps 
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by time r;^£ is bounded from below by a positive constant, independent of L large. The 
term inside the brackets in (j34p will then come from the probability that a large event 
occurs before time r/?/;£ and the first such event leads to the coalescence of the lineages 
(i.e., a jump onto for X^). 

Let r/ > and x G B{0, R^iPl), and let denote the epoch of the first large event 
affecting X^. By the argument given above, the probability that X^ starting at x hits 
before leaving -6(0, (1 + 5)R^'il>i,) is bounded from below by the probability that X^ 
started at x stays within this ball until r^, Tg is less than or equal to "qip^ and the first 
large event leads to the coalescence of the lineages. Writing Si^ri for the event that X^ 
stays within -6(0, (1 + 6)R^iPl) before and < irtpj^, this probability is equal to 

Fx [the first large event is a coalescence | £L,ri] Fx [SL,ri] ■ (68) 

If, for each L G N, Pi}E < oo denotes the rate at which a single lineage on T(L) 
is affected by a large reproduction event, then the rate at which at least one of two 
lineages are affected is bounded by twice this quantity, and so the first probability in 
()68p is bounded from below by 9c/{2E). Now, X^ experiences no large reproduction 
event before time r^, and so we can again use the equality in distribution stated in the 
proof of Proposition 16.41 (6) (we also keep the notation introduced there). Write texit 
for the first time X^ leaves -6(0, (1 + 6)R^iPl), and toxit for the corresponding time for 
X^ (which sees only small events). We have 

Fx[£l,v] =IP4*exit >rj^; rj^< #i] 
= Fx[ie.it > e(X^); e(l^) < 
>IP4*~cxit > e{X^) < t]iIjI] 

= Fx [e{X^) < r]i^l I 4xit > f?^i] Fx [texit > vi^l] ■ (69) 

Since a pair of lineages is affected by a large event at rate at least pJ}E^ the first 
probability on the right-hand side of ()69p is bounded below for all x G -6(0, R^tPl) by 

1 — exp < — r]E—^ 



PL 

Now, if X^ starts within -6(0, -R^^/^/,), it needs to cover a distance of at least SR^iIj^ 
to exit -6(0, (1 + 5)R^ij)L)- Furthermore, coalescence events tend to keep X^ within 
-6(0,(1 + 6)R^iPl)-, and so for each x, the second probability on the right-hand side 
of (f69|) is larger than Po[texit > V^f}^ where texit is the exit time from B{0,SR^ipi) 
of the process {i^{2t),t > 0} which experiences only small jumps. Decomposing this 
Levy process into the sum of its jumps and applying Doob's maximal inequality to the 
submartingale we obtain 



sup |e'^(2t)|^ > {dR'^^l^L 



0<t<rjtpl/2 



< 
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where the last equahty comes from (|22p . Choosing 77 > small enough so that the 
quantity above is less than 1, we obtain that for all x G -6(0, R^iI^l) 



Px[texit > > Po[4xit > r/Vi] > 1 



04 > 0. 



□ 



Combining the above and choosing 62 = 0^6^/ {2E) and ^3 = r]E, we obtain ([2 

Proof of Lemma \6.y[ If we were considering the times Tjj rather than t*^ , Lemma 
would follow from the same arguments as in Cox & Griffeath (1986) (see Lemma 1). 
Here, we have to work a bit harder and decompose the event in (j44p into more cases. 
Recall the definition of wl given in the statement of Theorem 13.31 For each L G N, the 
probability in (j44p is bounded by 



r < 



Wl 



'12 I 



"^"12; 



VlogL 
Wl 



T > 
<T = 



Wl 



T12 <T 12 



Wl 



ViogX 

Wl 



<T = ri2; T12 > T12 



(logL)2 
Wl 



(logL)- 



r,3i e {l,2},ri3 e (ri2,Ti2] 



<r = n2;ViG {l,2},r,3 >Tr2;|ef(r* 



< 



(70) 
(71) 
(72) 
(73) 



Suppose first that pL ^ ipilogL. The first term in ()70p is bounded by the sum over 
i / j G {1, . . . ,4}2 of Pyi^[Tjj < Ci72,(log L)~-'^/2]^ which tends to uniformly in Al by 
Proposition 16.21 and the consistency of the genealogy. The quantity in (j7ip . expressing 
the probability that the first pair to meet is the pair (1,2) but then coalescence of these 
lineages takes longer than wl / {log L)'^ units of time, is therefore bounded by 

Wl 



Pa, 



'12 



T12 > 



(logL)2. 

which converges to zero as L — )■ c«, uniformly in Al (apply the strong Markov property 
at time T12 and use (a) of Proposition 16. 4p . The expression in (j72p corresponds to the 
event in which (1,2) is the first pair to meet and "quickly" merge, but another pair of 
lineages manages to meet between T12 and rj*2- 1^ is thus bounded by 



Wl 
VlogL 

+Pa, 



< T = T12 ; Ti3 S ri2,Ti2 + 



Wl 



Wl 



< T 



(logL)2 

T12 ; T23 G (ti2,Ti2 + 



Wl 



Applying the strong Markov property at time T12 and using Lemma [5.5l with {£^{t))t>o = 
(V'Z^i^^ ~ ^3 }((V'i A PL)t))t>o for each i G {1, 2} (as in the proof of TheoremES]), we 
can conclude that each of the above terms tends to uniformly in Al- On the event 
described by (|73]l . that is (1, 2) is the first pair to meet and merge, no other pair meets 
in between but the distance between lineages 1 and 3 at time r* is smaller than L/ log L, 
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the differences {^f — ^2 } {^1 ~ ^3 } have the same distribution as two independent 
copies and of the process run at speed 2 up until r, and so if we write Tl 
(resp., Tl) for the entrance time of (resp., into B{0, 2R^tPl), with a shght abuse 
of notation for the initial value to simplify the notation, (j73p is bounded by 



Tl>Tl> 



L 



logL 



Tl > 



VOL 



\/logX 



;|e''(TL)| < 



logL 



A straightforward application of Lemma l5.4l (fe) with (^^(t))t>o = 0^2^ (iPL^i^L)i))t>o 
yields the uniform convergence of the last term to 0. Finally, the second term in (j70p . 
i.e., the probability that (1, 2) is the first pair to meet but not to merge, is bounded by 
the sum over all pairs {i,j} G {1, . . . , 4}^ such that i j and {i,j} / {1, 2} of 



r > 



Wl 



\/logX 



; T 



< 



ZUl 



\/logX 

— T-* > 



< r = Tij < 



Wl 



Wl 



ViogX 



; Tij G 



(logL)2' 



'12 



(logL) 



T* 

) ^12 



We can now conclude as we did for ()7ip and ()72p . 

When PL ^ "^L^ogL, we saw in the proof of Theorem 16.31 that with probability 
increasing to 1 , a pair of lineages will not be affected by a large event during the periods 
of time when the lineages are at distance less than 2R^ij)L from each other, until they 
come at distance less than 2i?*. Consequently, we could consider the evolution of the 
lineages to be independent until their gathering time at distance 2W . Because we are 
still considering a finite number of lineages, the arguments we used are applicable here 
again, and the proof of the last paragraph also yields ()44p in this case. The proof of 
5]) is analogous, and is therefore omitted. □ 
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